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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: St George's Hospital Medical School 

(B) STREET: Cranmer Terrace 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): SW17 ORE 

(ii) TITLE OF INVENTION: NOVEL POLYNUCLEOTIDES AND POLYPEPTIDES IN 
PATHOGENIC MYCOBACTERIA AND THEIR USE AS DIAGNOSTICS, 
VACCINES AND TARGETS FOR CHEMOTHERAPY 

(iii) NUMBER OF SEQUENCES: 41 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB96/03221 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GATCCAACTA 


AACCCGATGG 


AACCCCGCGC 


AAACTATTGG 


ACGTCTCCGC 


GCTACGCAGT 


60 


TGGGTTGGCG 


CCCGCGAATC 


GCACTGAAAG 


AGGGCATCGA 


TGCAACGGTG 


TCGTGGTACC 


120 


GCACAAATGC 


CGATGCCGTG 


AGGAGGTAAA 


GCTGCGGGCC 


GGCCGATGTT 


ATCCCTCCGG 


180 


CCGGACGGGT 


AGGGCGACCT 


GCCATCGAGT 


GGTACGGCAG 


TCGCCTGGCC 


GGCGAGGCGC 


240 


ATGGCCTATG 


TGAGTATCCC 


ATAGCCTGGC 


TTGGCTCGCC 


CCTACGCATT 


ATCAGTTGAC 


300 



40 



r 




CGCTTTCGCG CCACGTCGCA GGCTTGCGGC AGCATCCCGT TCAGGTCTCC TCATGGTCCG 360 

GTGTGGCACG ACCACGCAAG CTCGAACCGA CTCGTTTGCC AATTTCGCAT GCTAATATCG 420 

CTCGATGGAT TTTTTGCGCA ACGCCGGCTT GATGGCTCGT AACGTTAGCA CCGAGATGCT 480 

GCGCCACTCC GAACGAAAGC GCCTATTAGT AAACCAAGTC GAAGCATACG GAGTCAACGT 540 

TGTTATTGAT GTCGGTGCTA ACTCCGGCCA GTTCGGTAGC GCTTTGCGTC GTGCAGGATT 600 

CAAGAGCCGT ATCGTTTCCT TTGAACCTCT TTCGGGGCCA TTTGCGCAAC TAACGCGCAA 660 

GTCGGCATCG GATC 674 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GATCCGATGC CGACTTGCGC GTTAGTTGCG CAAATGGCCC CGAAAGAGGT TCAAAGGAAA 60 

CGATACGGCT CTTGAATCCT GCACGACGCA AAGCGCTACC GAACTGGCCG GAGTTAGCAC 120 

CGACATCAAT AACAACGTTG ACTCCGTATG CTTCGACTTG GTTTACTAAT AGGCGCTTTC 180 

GTTCGGAGTG GCGCAGCATC TCGGTGCTAA CGTTACGAGC CATCAAGCCG GCGTTGCGCA 240 

AAAAATCCAT CGAGCGATAT TAGCATGCGA AATTGGGAAA CGAGTCGGTT CGAGCTTGCG 300 

TGGTCGTGCC ACACCGGACC ATGAGGAGAC CTGAACGGGA TGCTGCCGCA AGCCTGCGAC 360 

GTGGCGCGAA AGCGGTCAAC TGATAATGCG TAGGGGCGAG CCAAGCCAGG CTATGGGATA 420 

CTCACATAGG CCATGCGCCT CGCCGGCCAG GCGACTGCCG TACCACTCGA TGGCAGGTCG 480 

CCCTACCCGT CCGGCCGGAG GGATAACATC GGCCGGCCCG CAGCTTTACC TCCTCACGGC 540 

ATCGGCATTT GTGCGGTACC ACGACACCGT TGCATCGATG CCCTCTTTCA GTGCGATTCG 600 

CGGGCGCCAA CCCAACTGCG TAGCGCGGAG ACGTCCAATA GTTTGCGCGG GGTTCCATCG 660 




GGTTTAGTTG GATC 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAATTCTGGG TTGGAGACGA CGTCGAACTC CTGGTCGGTC TTGCTTCGAA TGATCGCTGT 
GATCTGGTCG GCGGTGCCGA CAGGAACCGT CGACTTGTCG ACGATCACCT TGTACCGGTC 
GATGTATGAC CCAATGTCGT CCGCAACCGA GAAGACGTAC GTCAGGTCCG CCGCCCCGCT 
TTCACCCATG GGCGTCGGGA CGGCGATGAA AATGACGTCC GCGTGCTCGA TTCCGCGTTG 
CCGGTCGGTG GTGAAGTCAA TCAGCCCGTT CTCACGGTTC CTCGCAATCA ACTCCCAACC 
CGGGCTCGAA AATCGGGACA CTGCCTGCGA GGAGCAAATC GATCTTGGCC TGATCGATAT 
CGACACAGAC GACATCGTTG CCGCTATCCG CGAGACAGGC GCCCGTGACG AGGCCTACAT 
AGCCTGATCC GACCACCGAA ATTTTCAAGA TGACCCCTTC AAGTCCCCGA TCGGTCGACG 
ACCATACTGC CGCAACTCTG TACCCTCCGT GGGTAATTCG CATGTCGCGT TCGTAAGGAG 
CAGCCAGCGA GTCGGGGACG TTCGGTGAGA GAGTCGCAGG ACTACGAGGT TGCCGGTGCG 
ATACATCACA GTGTTGCGTC TGTCGGCAAC GATGCAGCAA GAACCCACGG GGCAGCCCTG 
AACTGCGCGC ATGACCGGTC CTTGTCCTGG CACCTTTGAT CGGCCACCGC TTCCATGCGA 
ACATGACCGG AATCCATAGC GCGTGGTCAA GCAGCGGGGA GGTAGACGTC GGTGTCATCT 
GCTCCAACCG TGTCGGTGAT AACGATTTCG CTGAACGATC TCGAGGGATT GAAAAGCACC 
GTGGAGAGCG TTCGCGCGCA GCGCTATGGG GGGCGAATCG AGCACATCGT CATCGACGGT 
GGATCGGGCG ACGCCGTCGT GGAGTATCTG TCCGGCGATC CTGGCTTTGC ATATTGGCAA 
TCTCAGCCCG ACAACGGGAG ATATGACGCG ATGAATCAGG GCATTGCCCA TTCGTCGGGC 




# 



• 



GACCTGTTGT GGTTTATGCA CTCCACGGAT CGTTTCTCCG ATCCAGATGC AGTCGCTTCC 1080 

GTGGTGGAGG CGCTCTCGGG GCATGGACCA GTACGTGATT TGTGGGGTTA CGGGAAAAAC 1140 

AACCTTGTCG GACTCGACGG CAAACCACTT TTCCCTCGGC CGTACGGCTA TATGCCGTTT 1200 

AAGATGCGGA AATTTCTGCT CGGCGCGACG GTTGCGCATC AGGCGACATT CTTCGGCGCG 1260 

TCGCTGGTAG CCAAGTTGGG CGGTTACGAT CTTGATTTTG GACTCGAGGC GGACCAGCTG 1320 

TTCATCTACC GTGCCGCACT AATACGGCCT CCCGTCACGA TCGACCGCGT GGTTTGCGAC 1380 

TTCGATGTCA CGGGACCTGG TTCAACCCAG CCCATCCGTG AGCACTATCG GACCCTGCGG 1440 

CGGCTCTGGG ACCTGCATGG CGACTACCCG CTGGGTGGGC GCAGAGTGTC GTGGGCTTAC 1500 

TTGCGTGTGA AGGAGTACTT GATTCGGGCC GACCTGGCCG CATTCAACGC GGTAAAGTTC 1560 

TTGCGAGCGA AGTTCGCCAG AGCTTCGCGG AAGCAAAATT CATAGAAACC AACTTCTACT 1620 

GCCTGACCTG AGCAGCGCCG AGGCGCGCAG CGCGATCAGT GCGACCTGAA CGGCCAGGTG 1680 

GAAAGCGCCA CCGATCCCGG CACCGAGTGC CTGACGCTTC GGATCCCTTG CACCACAACG 1740 

AGAGTGAGAG CGCCATGATG AGGAAATATC GGCTGGGCGG AGTCAACGCC GGAGTGACAA 1800 

AAGTGAGAAC CCGGTGAAGC GAGCGCTTAT AACAGGGATC ACGGGGCAGG ATGGTTCCTA 1860 

CCTCGCCGAG CTACTACTGA GCAAGGGATA CGAGGTTCAC GGGCTCGTTC GTCGAGCTTC 1920 

GACGTTTAAC ACGTCGCGGA TCGATCACCT CTACGTTGAC CCACACCAAC CGGGCGCGCG 1980 

CTTGTTCTTG CACTATGCAG ACCTCACTGA CGGCACCCGG TTGGTGACCC TGCTCAGCAG 2040 

TATCGACCCG GATGAGGTCT ACAACCTCGC AGCGCAGTCC CATGTGCGCG TCAGCTTTGA 2100 

CGAGCCAGTG CATACCGGAG ACACCACCGG CATGGGATCG ATCCGACTTC TGGAAGCAGT 2160 

CCGCCTTTCT CGGGTGGACT GCCGGTTCTA TCAGGCTTCC TCGTCGGAGA TGTTCGGCGC 2220 

ATCTCCGCCA CCGCAGAACG AATCGACGCC GTTCTATCCC CGTTCGCCAT ACGGCGCGGC 2280 

CAAGGTCTTC TCGTACTGGA CGACTCGCAA CTATCGAGAG GCGTACGGAT TATTCGCAGT 2340 

GAATGGCATC TTGTTCAACC ATGAGTCCCC CCGGCGCGGC GAGACTTTCG TGACCCGAAA 2400 

GATCACGCGT GCCGTGGCGC GCATCCGAGC TGGCGTCCAA TCGGAGGTCT ATATGGGCAA 2460 




# 



CCTCGATGCG ATCCGCGACT GGGGCTACGC GCCCGAATAT GTCGAGGGGA TGTGGAGGAT 
GTTGCAAGCG CCTGAACCTG ATGACTACGT CCTGGCGACA GGGCGTGGTT ACACCGTACG 
TGAGTTCGCT CAAGCTGCTT TTGACCATGT CGGGCTCGAC TGGCAAAAGC GCGTCAAGTT 
TGACGACCGC TATTTGCGTC CCACCGAGGT CGATTCGCTA GTAGGAGATG CCGACAAGGC 
GGCCCAGTCA CTCGGCTGGA AAGCTTCGGT TCATACTGGT GAACTCGCGC GCATCATGGT 
GGACGCGGAC ATCGCCGCGT TGGAGTGCGA TGGCACACCA TGGATCGACA CGCCGATGTT 
GCCTGGTTGG GGCAGAGTAA GTTGACGACT ACACCTGGGC CTCTGGACCG CGCAACGCCC 
GTGTATATCG CCGGTCATCG GGGGCTGGTC GGCTCAGCGC TCGTACGTAG ATTTGAGGCC 
GAGGGGTTCA CCAATCTCAT TGTGCGATCA CGCGATGAGA TTGATCTGAC GGACCGAGCC 
GCAACGTTTG ATTTTGTGTC TGAGACAAGA CCACAGGTGA TCATCGATGC GGCCGCACGG 
GTCGGCGGCA TCATGGCGAA TAACACCTAT CCCGCGGACT TCTTGTCCGA AAACCTCCGA 
ATCCAGACCA ATTTGCTCGA CGCAGCTGTC GCCGTGCGTG TGCCGCGGCT CCTTTTCCTC 
GGTTCGTCAT GCATCTACCC GAAGTACGCT CCGCAACCTA TCCACGAGAG TGCTTTATTG 
ACTGGCCCTT TGGAGCCCAC CAACGACGCG TATGCGATCG CCAAGATCGC CGGTATCCTG 
CAAGTTCAGG CGGTTAGGCG CCAATATGGG CTGGCGTGGA TCTCTGCGAT GCCGACTAAC 
CTCTACGGAC CCGGCGACAA CTTCTCCCCG TCCGGGTCGC ATCTCTTGCC GGCGCTCATC 
CGTCGATATG AGGAAGCCAA AGCTGGTGGT GCAGAAGAGG TGACGAATTG GGGGACCGGT 
ACTCCGCGGC GCGAACTTCT GCATGTCGAC GATCTGGCGA GCGCATGCCT GTTCCTTTTG 
GAACATTTCG ATGGTCCGAA CCACGTCAAC GTGGGCACCG GCGTCGATCA CAGCATTAGC 
GAGATCGCAG ACATGGTCGC TACAGCGGTG GGCTACATCG GCGAAACACG TTGGGATCCA 
ACTAAACCCG ATGGAACCCC GCGCAAACTA TTGGACGTCT CCGCGCTACG CGAGTTGGGT 
TGGCGCCCGC GAATCGCACT GAAAGACGGC ATCGATGCAA CGGTGTCGTG GTACCGCACA 
AATGCCGATG CCGTGAGGAG GTAAAGCTGC GGGTCGGCCG ATGTTATCCC TCCGGCCGGA 
CGGGTGGGGC GACCTGCCGT CGAGTGGTAC GGCAGTCGCC TGGCCGGCGA GGCGCGTGGC 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 





CTATGGGAGT ATCCAATAGC CTGGCTTGGC TCGCCCCTAC GCATTATCAG TTGACCGCTT 
TCGCGCCAGC TCGCAGGCTT GCGGCAGCAT CCCGTTCAGG TCTCCTCATG GTCCGGTGTG 
GCACGACCAC GCAAGCTCGA ACCGACTCGT TTCCCAATTT CGCATGCTAA TATCGCTCGA 
TGGATTTTTT GCGCAACGCC GGCTTGATGG CTCGTAACGT TAGTACCGAG ATGCTGCGCC 
ACTTCGAACG AAAGCGCCTA TTAGTAAACC AATTCAAAGC ATACGGAGTC AACGTTGTTA 
TTGATGTCGG TGCTAACTCC GGCCAGTTCG GTAGCGCTTT GCGTCGTGCA GGATTCAAGA 
GCCGTATCGT TTCCTTTGAA CCTCTTTCGG GGCCATTTGC GCAACTAACG CGCAAGTCGG 
CATCGGATCC ACTATGGGAG TGTCACCAGT ATGCCCTAGG CGACGCCGAT GAGACGATTA 
CCATCAATGT GGCAGGCAAT GCGGGGGCAA GTAGTTCCGT GCTGCCGATG CTTAAAAGTC 
ATCAAGATGC CTTTCCTCCC GCGAATTATA TTGGCACCGA AGACGTTGCA ATACACCGCC 
TTGATTCGGT TGCATCAGAA TTTCTGAACC CTACCGATGT TACTTTCCTG AAGATCGACG 
TACAGGGTTT CGAGAAGCAG GTTATCACGG GCAGTAAGTC AACGCTTAAC GAAAGCTGCG 
TCGGCATGCA ACTCGAACTT TCTTTTATTC CGTTGTACGA AGGTGACATG CTGATTCATG 
AAGCGCTTGA ACTTGTCTAT TCCCTAGGTT TCAGACTGAC GGGTTTGTTG CCCGGCTTTA 
CGGATCCGCG CAATGGTCGA ATGCTTCAAG CTGACGGCAT TTTCTTCCGT GGGGACGATT 



GACATAAATG CTCCGTCGGC ACCCTGCCGG TATCCAAACG GGCGATCTGG TGAGCCGGCC 

TCCCGGGCAC CTAATCGACT ATCTAAATTG AGGCGGCCGC GACGTGCGGC ACGAACAGGT 

GGCCGGCTGC TAGCGTTACA CACGTCATGA CTGCGCCAGT GTTCTCGATA ATTATCCCTA 

CCTTCAATGC AGCGGTGACG CTGCAAGCCT GCCTCGGAAG CATCGTCGGG CAGACCTACC 

GGGAAGTGGA AGTGGTCCTT GTCGACGGCG GTTCGACCGA TCGGACCCTC GACATCGCGA 

ACAGTTTCCG CCCGGAACTC GGCTCGCGAC TGGTCGTTCA CAGCGGGCCC GATGATGGCC 

CCTACGACGC CATGAACCGC GGCGTCGGCG TGGCCACAGG CGAATGGGTA CTTTTTTTAG 

GCGCCGACGA CACCCTCTAC GAACCAACCA CGTTGGCCCA GGTAGCCGCT TTTCTCGGCG 

ACCATGCGGC AAGCCATCTT GTCTATGGCG ATGTTGTGAT GCGTTCGACG AAAAGCCGGC 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 




ATGCCGGACC TTTCGACCTC GACCGCCTCC TATTTGAGAC GAATTTGTGC CACCAATCGA 5400 

TCTTTTACCG CCGTGAGCTT TTCGACGGCA TCGGCCCTTA CAACCTGCGC TACCGAGTCT 5460 

GGGCGGACTG GGACTTCAAT ATTCGCTGCT TCTCCAACCC GGCGCTGATT ACCCGCTACA 5520 

TGGACGTCGT GATTTCCGAA TACAACGACA TGACCGGCTT CAGCATGAGG CAGGGGACTG 5580 

ATAAAGAGTT CAGAAAACGG CTGCCAATGT ACTTCTGGGT TGCAGGGTGG GAGACTTGCA 5640 

GGCGCATGCT GGCGTTTTTG AAAGACAAGG AGAATCGCCG TCTGGCCTTG CGTACGCGGT 5700 

TGATAAGGGT TAAGGCCGTC TCCAAAGAAC GAAGCGCAGA ACCGTAGTCG CGGATCCACA 5760 

TTGGACTTCT TTAACGCGTT TGCGTCCTGA TCCACCTTTC AAGCCCGTTC CGCGTAACGC 5820 

GGCGCGCAGA GAGTGGTCGC ATATCGCATC ACTGTTCTCG TGCCAGTGCT TGGAAAGCGT 5880 

CGAGCACTCT GGTTCGCGTT CTTGACGTTC GCGCCCGCTC CTAGAGGTAG CGTGTCACGT 5940 

GACTGAAGCC AATGAGTGCA ACTCGGCGTC GCGAAAGGTT TCAGTCGCGG TTGAGCAAGA 6000 

CACCGCAAGA CTACTGGAGT GCGTGCACAA GCGCCTCCAG CTCGCGGCTG AAAGCGGATG 6060 

CAAAGGGATT CGAAGCTTGA GCAACATGCG AAGGGGAGAA CGGCCTATGA GGCTGGGACA 6120 

GGTTTTCGAT CCGCGCGCGA ATGCACTGTC AATGGCCAAG TAGAAGTCCC CGCTGGTGGC 6180 

CAGCAGAAGT CCCCACTCCG CTGCGGGTGG TTGGCTAATT CTTGGCGGCT CCCTTCTTGT 6240 

GGTCGGCGTG GCGCATCCGG TAGGACTCGC CGGAGGTGAC GACGATGCTG GCGTGGTGCA 6300 

GCAGCCGATC GAGGATGCTG GCGGCGGTGG TGTGCTCGGG CAGGAATCGC CCCCATTGTT 6360 

CGAAGGGCCA ATGCGAGGCG ATGGCCAGGG AGCGGCGCTC GTAGCCGGCA GCCACGAGCC 6420 

GGAACAACAG TTGAGTCCCG GTGTCGTCGA GCGGGGCGAA GCCGATCTCG TCCAAGATGA 6480 

CCAGATCCGC GCGGAGCAGG GTGTCGATGA TCTTGCCGAC GGTGTTGTCG GCCAGGCCGC 6540 

GGTAGAGGAC CTCGATCAGG TCGGCGGCGG TGAAGTAGCG GACTTTGAAT CCGGCGTGGA 6600 

CGGCAGCGTG CCCGCAGCCG ATGAGCAGGT GACTTTTGCC CGTACCAGGT GGGCCAATGA 6660 

CCGCCAGGTT CTGTTGTGCC CGAATCCATT CCAGGCTCGA CAGGTAGTCG AACGTGGCTG 6720 

CGGTGATCGA CGATCCGGTG ACGTCGAACC CGTCGAGGGT CTTGGTGACC GGGAAGGCTG 6780 



Hi, 



# 



CGGCCTTGAG ACGGTTGGCG GTGTTGGAGG CATCGCGGGC AGCGATCTCG GCCTCAACCA 6840 

ACGTCCGCAG GATCTCCTCC GGTGTCCAGC GTTGCGTCTT GGCGACTTGC AACACCTCGG 6900 

CGGCGTTGCG GCGCACCGTG GCCAGCTTCA ACCGCCGCAG CGCCGCGTCA AGGTCAGCAG 6960 

CCAGCGGTGC CGCCGAGGAC GGTGCCACCG GCTTGGCAGC GGTGGTCATG AGGCCGTCCC 7020 

GTCGGTGGTG TTGATCTTGT AGGCCTCCAA CGAGCGGGTC TCGACGGTGG GCAGATCGAG 7080 

CACGAGTGCG TCGCCGGCGG GGCGGGGTTG TGGGGTGCCG GCGCCGGCGG CCAGGATCGA 7140 

GCGCACGTCG GCAGCGCGGA ACCGGCGAAA CGCAACCGCC CGGCGCAGCG CGTCAATCAA 7200 

AGCCTGTTCG CCGTGGGCGG CGCCAAGGCC GAGCAGAATG TCGAGTTCGG ATTTCAGTCG 7260 

GGTGTTGCCG ATCGCAGCAG CACCGACGAG GAACTGCTGC GCTTCGGTTC CCAATGCGCA 7320 

GAATCGTTTC TCTGCTTGGG TTTTCGGGCG AGGACCACGC GAGGGTGCGG GTCTGGGTCC 7380 

GTCGTAGTGT TCATCGAGGA TGGACACCTC ACCTGGGCTG ACGAGCTCGT GCTCGGCCAC 7440 

GATCACACCG GTCGCAGGTT CCAACAGGAT CAGGGCGCCA TGATCGACCA CCACCGCCAC 7500 

GGTGGCACCG ACGAGCCGCT GAGGCACCGA GTAACGAGCT GAGCCGTAAC GGATGCACGA 7560 

GAGGCCGTCG ACCTTACGGC GCACCGACCC CGAGCCGATC GTCGGCCGCA GCGAGGGCAG 7620 

CTCCCTCAAG ACGGTGCGCT CGTCAACCAA GCGATCGTTG GGCACGGCGC AGATCTCCGA 7680 

GTGGACCGTG GCATTGACCT CGGCGCACCA TAGTTGCGCC TGGGCGTTGA GGGCACGTAG 7740 

GTCGACCTGC TCACCGGCTA ACGCAGCTTC GGTCAGCAGC GGCACCGCAA GGTCGTCCTG 7800 

AGCGTAGCCA CAGAGGTTCT CCACGATGCC CTTCGATTGC GGATCCGCAC CGTGGCAGAA 7860 

GTCCGGAACG AAGCCATAGT GGGACGCGAA TCGCACATAA TCCGGTGTTG GAACAACAAC 7920 

ATTGGCGACG ACACCACCTT TGAGGCAGCC CATCCGGTCG GCCAGGATCT TGGCCGGAAC 7980 



CCCACCGATC 



GCCTC 



7995 



(2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4435 base pairs 

(B) TYPE: nucleic acid 




# 



AACGCCCGTG TATATCGCCG GTCATCGGGG GCTGGTCGGC TCAGCGCTCG TACGTAGATT 1320 

TGAGGCCGAG GGGTTCACCA ATCTCATTGT GCGATCACGC GATGAGATTG ATCTGACGGA 1380 

CCGAGCCGCA ACGTTTGATT TTGTGTCTGA GACAAGACCA CAGGTGATCA TCGATGCGGC 1440 

CGCACGGGTC GGCGGCATCA TGGCGAATAA CACCTATCCC GCGGACTTCT TGTCCGAAAA 1500 

CCTCCGAATC CAGACCAATT TGCTCGACGC AGCTGTCGCC GTGCGTGTGC CGCGGCTCCT 1560 

TTTCCTCGGT TCGTCATGCA TCTACCCGAA GTACGCTCCG CAACCTATCC ACGAGAGTGC 1620 

TTTATTGACT GGCCCTTTGG AGCCCACCAA CGACGCGTAT GCGATCGCCA AGATCGCCGG 1680 

TATCCTGCAA GTTCAGGCGG TTAGGCGCCA ATATGGGCTG GCGTGGATCT CTGCGATGCC 1740 

GACTAACCTC TACGGACCCG GCGACAACTT CTCCCCGTCC GGGTCGCATC TCTTGCCGGC 1800 

GCTCATCCGT CGATATGAGG AAGCCAAAGC TGGTGGTGCA GAAGAGGTGA CGAATTGGGG 1860 

GACCGGTACT CCGCGGCGCG AACTTCTGCA TGTCGACGAT CTGGCGAGCG CATGCCTGTT 1920 

CCTTTTGGAA CATTTCGATG GTCCGAACCA CGTCAACGTG GGCACCGGCG TCGATCACAG 1980 

CATTAGCGAG ATCGCAGACA TGGTCGCTAC GGCGGTGGGC TACATCGGCG AAACACGTTG 2040 

GGATCCAACT AAACCCGATG GAACCCCGCG CAAACTATTG GACGTCTCCG CGCTACGCGA 2100 

GTTGGGTTGG CGCCCGCGAA TCGCACTGAA AGACGGCATC GATGCAACGG TGTCGTGGTA 2160 

CCGCACAAAT GCCGATGCCG TGAGGAGGTA AAGCTGCGGG CCGGCCGATG TTATCCCTCC 2220 

GGCCGGACGG GTAGGGCGAC CTGCCATCGA GTGGTACGGC AGTCGCCTGG CCGGCGAGGC 2280 

GCATGGCCTA TGGGAGTATC CCATAGCCTG GCTTGGCTCG CCCCTACGCA TTATCAGTTG 2340 

ACCGCTTTCG CGCCAGCTCG CAGGCTCGCG GCAGCATCCC GTTCAGGTCT CCTCATGGTC 2400 

CGGTGTGGCA CGACCACGCA AGCTCGAACC GACTCGTTTC CCAATTTCGC ATGCTAATAT 2460 

CGCTCGATGG ATTTTTTGCG CAACGCCGGC TTGATGGCTC GTAACGTTAG CACCGAGATG 2520 

CTGCGCCACT TCGAACGAAA GCGCCTATTA GTAAACCAAT TCAAAGCATA CGGAGTCAAC 2580 

GTTGTTATTG ATGTCGGTGC TAACTCCGGC CAGTTCGGTA GCGCTTTGCG TCGTGCAGGA 2640 

TTCAAGAGCC GTATCGTTTC CTTTGAACCT CTTTCGGGGC CATTTGCGCA ACTAACGCGC 2700 




# 



GAGTCGGCAT CGGATCCACT ATGGGAGTGT CACCAGTATG CCCTAGGCGA CGCCGATGAG 
ACGATTACCA TCAATGTGGC AGGCAATGCG GGGGCAAGTA GTTCCGTGCT GCCGATGCTT 
AAAAGTCATC AAGATGCCTT TCCTCCCGCG AATTATATTG GCACCGAAGA CGTTGCAATA 
CACCGCCTTG ATTCGGTTGC ATCAGAATTT CTGAACCCTA CCGATGTTAC TTTCCTGAAG 
ATCGACGTAC AGGGTTTCGA GAAGCAGGTT ATCGCGGGCA GTAAGTCAAC GCTTAACGAA 
AGCTGCGTCG GCATGCAACT CGAACTTTCT TTTATTCCGT TGTACGAAGG TGACATGCTG 
ATTCATGAAG CGCTTGAACT TGTCTATTCC CTAGGTTTCA GACTGACGGG TTTGTTGCCC 
GGATTTACGG ATCCGCGCAA TGGTCGAATG CTTCAAGCTG ACGGCATTTT CTTCCGTGGG 
p GACGATTGAC ATAAATGCTT GCGTCGGCAC CCTGCCGGTA TCCAAACGGG CGATCTGGTG 
□ AGCCGGCCTC CCGGGCACCT AATCGACTAT CTAAATTGAG GCGGCCGCGA CGTGCGGCAC 
f!Z GAACAGGTGG CCGGCTGCTA GCGTTACACA CGTCATGACT GCGCCAGTGT TCTCGATAAT 
~i TATCCCTACC TTCAATGCAG CGGTGACGCT GCAAGCCTGC CTCGGAAGCA TCGTCGGGCA 



GACCTACCGG GAAGTGGAAG TGGTCCTTGT CGACGGCGGT TCGACCGATC GGACCCTCGA 
CATCGCGAAC AGTTTCCGCC CGGAACTCGG CTCGCGACTG GTCGTTCACA GCGGGCCCGA 
TGATGGCCCC TACGACGCCA TGAACCGCGG CGTCGGCGTA GCCACAGGCG AATGGGTACT 
TTTTTTAGGC GCCGACGACA CCCTCTACGA ACCAACCACG TTGGCCCAGG TAGCCGCTTT 
TCTCGGCGAC CATGCGGCAA GCCATCTTGT CTATGGCGAT GTTGTGATGC GTTCGACGAA 
AAGCCGGCAT GCCGGACCTT TCGACCTCGA CCGCCTCCTA TTTGAGACGA ATTTGTGCCA 
CCAATCGATC TTTTACCGCC GTGAGCTTTT CGACGGCATC GGCCCTTACA ACCTGCGCTA 
CCGAGTCTGG GCGGACTGGG ACTTCAATAT TCGCTGCTTC TCCAACCCGG CGCTGATTAC 
CCGCTACATG GACGTCGTGA TTTCCGAATA CAACGACATG ACCGGCTTCA GCATGAGGCA 
GGGGACTGAT AAAGAGTTCA GAAAACGGCT GCCAATGTAC TTCTGGGTTG CAGGGTGGGA 
GACTTGCAGG CGCATGCTGG CGTTTTTGAA AGACAAGGAG AATCGCCGTC TGGCCTTGCG 
TACGCGGTTG ATAAGGGTTA AGGCCGTCTC CAAAGAACGA AGCGCAGAAC CGTAGTCGCG 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 




# ♦ 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

TTCTACTGCC TGACCTGAGC AGCGCCGAGG CGCGCAGCGC GATCACTGCG ACCTGAATGG 60 

CCAGGTGGAA AGCGCCACCG ATCCCGGCAC CGAGTGCCTG ACGATTCGGA TCCCTTGCAC 120 

CACAACGAGA GTGAGACCGC CATGATGACG AAATATCGGC TGGGCGGAGT CAACGCCGGA 180 

GTGACAAAAG TGAGAACCCG GTGAAGCGAG CGCTTATAAC AGGGATCACG GGGCAGGATG 240 

GTTCCTACCT CGCCGAGCTA CTACTGAGCA AGGGATACGA GGTTCACGGG CTCGTTCGTC 300 

GAGCTTCGAC GTTTAACACG TCGCGGATCG ATCACCTCTA CGTTGACCCA CACCAACCGG 360 

GCGCGCGCTT GTTCTTGCAC TATGCAGACC TCACTGACGG CACCCGGTTG GTGACCCTGC 420 

TCAGCAGTAT CGACCCGGAT GAGGTCTACA ACCTCGCAGC GCAGTCCCAT GTGCGCGTCA 480 

GCTTTGACGA GCCAGTGCAT ACCGGAGACA CCACCGGCAT GGGATCGATC CGACTTCTGG 540 

AAGCAGTCCG CCTTTCTCGG GTGGACTGCC GGTTCTATCA GGCTTCCTCG TCGGAGATGT 600 

TCGGCGCATC TCCGCCACCG CAGAACGAAT CGACGCCGTT CTATCCCCGT TCGCCATACG 660 

GCGCGGCCAA GGTCTTCTCG TACTGGACGA CTCGCAACTA TCGAGAGGCG TACGGATTAT 720 

TCGCAGTGAA TGGCATCTTG TTCAACCATG AGTCCCCCCG GCGCGGCGAG ACTTTCGTGA 780 

CCCGAAAGAT CACGCGTGCC GTGGCGCGCA TCCGAGCTGG CTGCCAATCG GAGGTCTATA 840 

TGGGCAACCT CGATGCGATC CGCGACTGGG GCTACGCGCC CGAATATGTC GAGGGGATGT 900 

GGAGGATGTT GCAAGCGCCT GAACCTGATG ACTACGTCCT GGCGACAGGG CGTGGTTACA 960 

CCGTACGTGA GTTCGCTCAA GCTGCTTTTG ACCACGTCGG GCTCGACTGG CAAAAGCACG 1020 

TCAAGTTTGA CGACCGCTAT TTGCGCCCCA CCGAGGTCGA TTCGCTAGTA GGAGATGCCG 1080 

ACAGGGCGGC CCAGTCACTC GGCTGGAAAG CTTCGGTTCA TACTGGTGAA CTCGCGCGCA 1140 

TCATGGTGGA CGCGGACATC GCCGCGTCGG AGTGCGATGG CACACCATGG ATCGACACGC 1200 

CGATGTTGCC TGGTTGGGGC GGAGTAAGTT GACGACTACA CCTGGGCCTC TGGACCGCGC 1260 
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GATCCACATT GGACTTCTTT AACGCGTTTG CGTCCTGATC CACCTTTCAA CCCCGTTCCG 4200 

CGTGACGCGG CGCGCAGAGA GTGGTCGCAT ATCGCGTCAC TGTTCTCGTG CCAGTGCTTG 4260 

GAAAGCGTCG AGCACTCTGG TTCGCGTTCT TGACGTTCGC GCCCGCCCCT AGAGGTAGCG 4320 

TGTCACGTGA CTGAAGCCAA TGAGTGCAAC TCGGCGTCGC GAAAGGTTTC AGTCGCGGTT 4380 

GAGCAAGACA CCGCAAGACT ACTGGAGTGC GTGCACAAGC GCCTCCAGCT CACGG 4435 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .375 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG ATC GCT GTG ATC 
Met lie Ala Val He 
1 5 

TCG ACG ATC ACC TTG 
Ser Thr He Thr Leu 
20 

ACC GAG AAG ACG TAC 
Thr Glu Lys Thr Tyr 
35 

GTC GGG ACG GCG ATG 
Val Gly Thr Ala Met 
50 

CGG TCG GTG GTG AAG 
Arg Ser Val Val Lys 
65 

AAC TCC CAA CCC GGG 
Asn Ser Gin Pro Gly 



5\ 



TGG TCG GCG GTG CCG ACA GGA ACC GTC GAC TTG 48 
Trp Ser Ala Val Pro Thr Gly Thr Val Asp Leu 

10 15 

TAC CGG TCG ATG TAT GAC CCA ATG TCG TCC GCA 96 

Tyr Arg Ser Met Tyr Asp Pro Met Ser Ser Ala 

25 30 

GTC AGG TCC GCC GCC CCG CTT TCA CCC ATG GGC 144 

Val Arg Ser Ala Ala Pro Leu Ser Pro Met Gly 

40 45 

AAA ATG ACG TCC GCG TGC TCG ATT CCG CGT TGC 192 

Lys Met Thr Ser Ala Cys Ser lie Pro Arg Cys 

55 60 

TCA ATC AGC CCG TTC TCA CGG TTC CTC GCA ATC 240 

Ser He Ser Pro Phe Ser Arg Phe Leu Ala He 

70 75 80 

CTC GAA AAT CGG GAC ACT GCC TGC GAG GAG CAA 288 

Leu Glu Asn Arg Asp Thr Ala Cys Glu Glu Gin 



# 



85 



90 



95 



ATC GAT CTT GGC CTG ATC GAT ATC GAC ACA GAC GAC ATC GTT GCC GCT 
lie Asp Leu Gly Leu He Asp lie Asp Thr Asp Asp He Val Ala Ala 
100 105 110 
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ATC CGC GAG ACA GGC GCC CGT GAC GAG GCC TAC ATA GCC TGA 
He Arg Glu Thr Gly Ala Arg Asp Glu Ala Tyr He Ala 
115 120 125 



378 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met He Ala Val He Trp Ser Ala Val Pro Thr Gly Thr Val Asp Leu 
15 10 15 

Ser Thr He Thr Leu Tyr Arg Ser Met Tyr Asp Pro Met Ser Ser Ala 
20 25 30 

Thr Glu Lys Thr Tyr Val Arg Ser Ala Ala Pro Leu Ser Pro Met Gly 
35 40 45 

Val Gly Thr Ala Met Lys Met Thr Ser Ala Cys Ser He Pro Arg Cys 
50 55 60 

Arg Ser Val Val Lys Ser He Ser Pro Phe Ser Arg Phe Leu Ala He 
65 70 75 80 

Asn Ser Gin Pro Gly Leu Glu Asn Arg Asp Thr Ala Cys Glu Glu Gin 
85 90 95 

He Asp Leu Gly Leu He Asp He Asp Thr Asp Asp He Val Ala Ala 
100 105 110 

He Arg Glu Thr Gly Ala Arg Asp Glu Ala Tyr He Ala 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION :1. .831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GTG TCA TCT GCT CCA ACC GTG TCG GTG ATA ACG ATT TCG CTG AAC GAT 
Val Ser Ser Ala Pro Thr Val Ser Val He Thr He Ser Leu Asn Asp 
130 135 140 

CTC GAG GGA TTG AAA AGC ACC GTG GAG AGC GTT CGC GCG CAG CGC TAT 
Leu Glu Gly Leu Lys Ser Thr Val Glu Ser Val Arg Ala Gin Arg Tyr 
145 150 155 

GGG GGG CGA ATC GAG CAC ATC GTC ATC GAC GGT GGA TCG GGC GAC GCC 
Gly Gly Arg He Glu His He Val He Asp Gly Gly Ser Gly Asp Ala 
160 165 170 

GTC GTG GAG TAT CTG TCC GGC GAT CCT GGC TTT GCA TAT TGG CAA TCT 
Val Val Glu Tyr Leu Ser Gly Asp Pro Gly Phe Ala Tyr Trp Gin Ser 
175 180 185 

CAG CCC GAC AAC GGG AGA TAT GAC GCG ATG AAT CAG GGC ATT GCC CAT 
Gin Pro Asp Asn Gly Arg Tyr Asp Ala Met Asn Gin Gly He Ala His 
190 195 200 205 

TCG TCG GGC GAC CTG TTG TGG TTT ATG CAC TCC ACG GAT CGT TTC TCC 
Ser Ser Gly Asp Leu Leu Trp Phe Met His Ser Thr Asp Arg Phe Ser 
210 215 220 

GAT CCA GAT GCA GTC GCT TCC GTG GTG GAG GCG CTC TCG GGG CAT GGA 
Asp Pro Asp Ala Val Ala Ser Val Val Glu Ala Leu Ser Gly His Gly 
225 230 235 

CCA GTA CGT GAT TTG TGG GGT TAC GGG AAA AAC AAC CTT GTC GGA CTC 
Pro Val Arg Asp Leu Trp Gly Tyr Gly Lys Asn Asn Leu Val Gly Leu 
240 245 250 

GAC GGC AAA CCA CTT TTC CCT CGG CCG TAC GGC TAT ATG CCG TTT AAG 
Asp Gly Lys Pro Leu Phe Pro Arg Pro Tyr Gly Tyr Met Pro Phe Lys 
255 260 265 



ATG CGG AAA TTT CTG CTC GGC GCG ACG GTT GCG CAT CAG GCG ACA TTC 480 
Met Arg Lys Phe Leu Leu Gly Ala Thr Val Ala His Gin Ala Thr Phe 
270 275 280 285 

TTC GGC GCG TCG CTG GTA GCC AAG TTG GGC GGT TAC GAT CTT GAT TTT 528 
Phe Gly Ala Ser Leu Val Ala Lys Leu Gly Gly Tyr Asp Leu Asp Phe 
290 295 300 

GGA CTC GAG GCG GAC CAG CTG TTC ATC TAC CGT GCC GCA CTA ATA CGG 576 
Gly Leu Glu Ala Asp Gin Leu Phe He Tyr Arg Ala Ala Leu He Arg 
305 310 315 

CCT CCC GTC ACG ATC GAC CGC GTG GTT TGC GAC TTC GAT GTC ACG GGA 624 
Pro Pro Val Thr He Asp Arg Val Val Cys Asp Phe Asp Val Thr Gly 
320 325 330 

CCT GGT TCA ACC CAG CCC ATC CGT GAG CAC TAT CGG ACC CTG CGG CGG 672 
Pro Gly Ser Thr Gin Pro He Arg Glu His Tyr Arg Thr Leu Arg Arg 
335 340 345 

CTC TGG GAC CTG CAT GGC GAC TAC CCG CTG GGT GGG CGC AGA GTG TCG 720 
Leu Trp Asp Leu His Gly Asp Tyr Pro Leu Gly Gly Arg Arg Val Ser 
350 355 360 365 

TGG GCT TAC TTG CGT GTG AAG GAG TAC TTG ATT CGG GCC GAC CTG GCC 768 
Trp Ala Tyr Leu Arg Val Lys Glu Tyr Leu He Arg Ala Asp Leu Ala 
370 375 380 

GCA TTC AAC GCG GTA AAG TTC TTG CGA GCG AAG TTC GCC AGA GCT TCG 816 
Ala Phe Asn Ala Val Lys Phe Leu Arg Ala Lys Phe Ala Arg Ala Ser 
385 390 395 

CGG AAG CAA AAT TCA TAG 834 
Arg Lys Gin Asn Ser 
400 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
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Val Ser Ser Ala Pro Thr Val Ser Val He Thr lie Ser Leu Asn Asp 
1 5 10 15 

Leu Glu Gly Leu Lys Ser Thr Val Glu Ser Val Arg Ala Gin Arg Tyr 
20 25 30 

Gly Gly Arg He Glu His He Val He Asp Gly Gly Ser Gly Asp Ala 
35 40 45 

Val Val Glu Tyr Leu Ser Gly Asp Pro Gly Phe Ala Tyr Trp Gin Ser 
50 55 60 

Gin Pro Asp Asn Gly Arg Tyr Asp Ala Met Asn Gin Gly He Ala His 
65 70 75 80 

Ser Ser Gly Asp Leu Leu Trp Phe Met His Ser Thr Asp Arg Phe Ser 
85 90 95 

Asp Pro Asp Ala Val Ala Ser Val Val Glu Ala Leu Ser Gly His Gly 
100 105 110 

Pro Val Arg Asp Leu Trp Gly Tyr Gly Lys Asn Asn Leu Val Gly Leu 
115 120 125 

Asp Gly Lys Pro Leu Phe Pro Arg Pro Tyr Gly Tyr Met Pro Phe Lys 
130 135 140 

Met Arg Lys Phe Leu Leu Gly Ala Thr Val Ala His Gin Ala Thr Phe 
145 150 155 160 

Phe Gly Ala Ser Leu Val Ala Lys Leu Gly Gly Tyr Asp Leu Asp Phe 
165 170 175 

Gly Leu Glu Ala Asp Gin Leu Phe He Tyr Arg Al a A'a Leu He Arg 
180 185 190 

Pro Pro Val Thr He Asp Arg Val Val Gys Asp Pha.Asp Val Thr Gly 
195 200 * 205 

Pro Gly Ser Thr Gin Pro lie Arg Glu His Tyr Arg Thr Leu Arg Arg 
210 215 220 

Leu Trp Asp Leu His Gly Asp Tyr Pro Leu Gly Gly Arg Arg Val Ser 
225 230 235 240 



Trp Ala Tyr Leu Arg Val Lys Glu Tyr Leu He Arg Ala Asp Leu Ala 
245 250 255 
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Ala Phe Asn Ala Val Lys Phe Leu Arg Ala Lys Phe Ala Arg Ala Ser 
260 265 270 

Arg Lys Gin Asn Ser 
275 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1 . . 1029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTG AAG CGA GCG CTT ATA ACA GGG ATC ACG GGG CAG GAT GGT TCC TAC 48 

Val Lys Arg Ala Leu He Thr Gly He Thr Gly Gin Asp Gly Ser Tyr 

280 285 290 

CTC GCC GAG CTA CTA CTG AGC AAG GGA TAC GAG GTT CAC GGG CTC GTT 96 

Leu Ala Glu Leu Leu Leu Ser Lys Gly Tyr Glu Val His Gly Leu Val 
295 300 305 

CGT CGA GCT TCG ACG TTT AAC ACG TCG CGG ATC GAT CAC CTC TAC GTT 144 

Arg Arg Ala Ser Thr Phe Asn Thr Ser Arg He Asp His Leu Tyr Val 
310 315 320 325 

GAC CCA CAC CAA CCG GGC GCG CGC TTG TTC TTG CAC TAT GCA GAC CTC 192 

Asp Pro His Gin Pro Gly Ala Arg Leu Phe Leu His Tyr Ala Asp Leu 
330 " 335 340 

ACT GAC GGC ACC CGG TTG GTG ACC CTG CTC AGC AGT ATC GAC CCG GAT 240 

Thr Asp Gly Thr Arg Leu Val Thr Leu Leu Ser Ser He Asp Pro Asp 
345 350 355 

GAG GTC TAC AAC CTC GCA GCG CAG TCC CAT GTG CGC GTC AGC TTT GAC 288 

Glu Val Tyr Asn Leu Ala Ala Gin Ser His Val Arg Val Ser Phe Asp 

360 365 370 
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GAG CCA GTG CAT ACC GGA GAC ACC ACC GGC ATG GGA TCG ATC CGA CTT 336 
Glu Pro Val His Thr Gly Asp Thr Thr Gly Met Gly Ser He Arg Leu 
375 380 385 

CTG GAA GCA GTC CGC CTT TCT CGG GTG GAC TGC CGG TTC TAT CAG GCT 384 
Leu Glu Ala Val Arg Leu Ser Arg Val Asp Cys Arg Phe Tyr Gin Ala 
390 395 400 405 

TCC TCG TCG GAG ATG TTC GGC GCA TCT CCG CCA CCG CAG AAC GAA TCG 432 
Ser Ser Ser Glu Met Phe Gly Ala Ser Pro Pro Pro Gin Asn Glu Ser 
410 415 420 

ACG CCG TTC TAT CCC CGT TCG CCA TAC GGC GCG GCC AAG GTC TTC TCG 480 
Thr Pro Phe Tyr Pro Arg Ser Pro Tyr Gly Ala Ala Lys Val Phe Ser 
425 430 435 

TAC TGG ACG ACT CGC AAC TAT CGA GAG GCG TAC GGA TTA TTC GCA GTG 528 
Tyr Trp Thr Thr Arg Asn Tyr Arg Glu Ala Tyr Gly Leu Phe Ala Val 
440 445 450 

AAT GGC ATC TTG TTC AAC CAT GAG TCC CCC CGG CGC GGC GAG ACT TTC 576 
Asn Gly He Leu Phe Asn His Glu Ser Pro Arg Arg Gly Glu Thr Phe 
455 460 465 

GTG ACC CGA AAG ATC ACG CGT GCC GTG GCG CGC ATC CGA GCT GGC GTC 624 
Val Thr Arg Lys He Thr Arg Ala Val Ala Arg He Arg Ala Gly Val 
470 475 480 485 

CAA TCG GAG GTC TAT ATG GGC AAC CTC GAT GCG ATC CGC GAC TGG GGC 672 
Gin Ser Glu Val Tyr Met Gly Asn Leu Asp Ala He Arg Asp Trp Gly 
490 495 500 

TAC GCG CCC GAA TAT GTC GAG GGG ATG TGG AGG ATG TTG CAA GCG CCT 720 
Tyr Ala Pro Glu Tyr Val Glu Gly Met Trp Arg Met Leu Gin Ala Pro 
505 510 515 

GAA CCT GAT GAC TAC GTC CTG GCG ACA GGG CGT GGT TAC ACC GTA CGT 768 
Glu Pro Asp Asp Tyr Val Leu Ala Thr Gly Arg Gly Tyr Thr Val Arg 
520 525 530 

GAG TTC GCT CAA GCT GCT TTT GAC CAT GTC GGG CTC GAC TGG CAA AAG 816 
Glu Phe Ala Gin Ala Ala Phe Asp His Val Gly Leu Asp Trp Gin Lys 
535 540 545 

CGC GTC AAG TTT GAC GAC CGC TAT TTG CGT CCC ACC GAG GTC GAT TCG 864 
Arg Val Lys Phe Asp Asp Arg Tyr Leu Arg Pro Thr Glu Val Asp Ser 
550 555 560 565 
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CTA GTA GGA GAT GCC GAC AAG GCG GCC CAG TCA CTC GGC TGG AAA GCT 
Leu Val Gly Asp Ala Asp Lys Ala Ala Gin Ser Leu Gly Trp Lys Ala 
570 575 580 



912 



TCG GTT CAT ACT GGT GAA CTC GCG CGC ATC ATG GTG GAC GCG GAC ATC 
Ser Val His Thr Gly Glu Leu Ala Arg He Met Val Asp Ala Asp He 
585 590 595 



960 



GCC GCG TTG GAG TGC GAT GGC ACA CCA TGG ATC GAC ACG CCG ATG TTG 
Ala Ala Leu Glu Cys Asp Gly Thr Pro Trp He Asp Thr Pro Met Leu 
600 605 610 



1008 



CCT GGT TGG GGC AGA GTA AGT TGA 
Pro Gly Trp Gly Arg Val Ser 
615 620 



1032 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Val Lys Arg Ala Leu He Thr Gly He Thr Gly Gin Asp Gly Ser Tyr 
15 10 15 

Leu Ala Glu Leu Leu Leu Ser Lys Gly Tyr Glu Val His Gly Leu Val 
20 25 30 

Arg Arg Ala Ser Thr Phe Asn Thr Ser Arg He Asp His Leu Tyr Val 
35 40 45 

Asp Pro His Gin Pro Gly Ala Arg Leu Phe Leu His Tyr Ala Asp Leu 
50 55 60 

Thr Asp Gly Thr Arg Leu Val Thr Leu Leu Ser Ser He Asp Pro Asp 
65 70 75 80 

Glu Val Tyr Asn Leu Ala Ala Gin Ser His Val Arg Val Ser Phe Asp 
85 90 95 

Glu Pro Val His Thr Gly Asp Thr Thr Gly Met Gly Ser He Arg Leu 
100 105 110 
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Leu Glu Ala Val Arg Leu Ser Arg Val Asp Cys Arg Phe Tyr Gin Ala 
115 120 125 

Ser Ser Ser Glu Met Phe Gly Ala Ser Pro Pro Pro Gin Asn Glu Ser 
130 135 140 

Thr Pro Phe Tyr Pro Arg Ser Pro Tyr Gly Ala Ala Lys Val Phe Ser 
145 150 155 160 

Tyr Trp Thr Thr Arg Asn Tyr Arg Glu Ala Tyr Gly Leu Phe Ala Val 
165 170 175 

Asn Gly lie Leu Phe Asn His Glu Ser Pro Arg Arg Gly Glu Thr Phe 
180 185 190 

Val Thr Arg Lys He Thr Arg Ala Val Ala Arg He Arg Ala Gly Val 
195 200 205 

Gin Ser Glu Val Tyr Met Gly Asn Leu Asp Ala He Arg Asp Trp Gly 
210 215 220 

Tyr Ala Pro Glu Tyr Val Glu Gly Met Trp Arg Met Leu Gin Ala Pro 
225 230 235 240 

Glu Pro Asp Asp Tyr Val Leu Ala Thr Gly Arg Gly Tyr Thr Val Arg 
245 250 255 

Glu Phe Ala Gin Ala Ala Phe Asp His Val Gly Leu Asp Trp Gin Lys 
260 265 270 

Arg Val Lys Phe Asp Asp Arg Tyr Leu Arg Pro Thr Glu Val Asp Ser 
275 280 285 

Leu Val Gly Asp Ala Asp Lys Ala Ala Gin Ser Leu Gly Trp Lys Ala 
290 295 300 

Ser Val His Thr Gly Glu Leu Ala Arg He Met Val Asp Ala Asp He 
305 310 315 320 

Ala Ala Leu Glu Cys Asp Gly Thr Pro Trp He Asp Thr Pro Met Leu 
325 330 335 

Pro Gly Trp Gly Arg Val Ser 
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(2) INFORMATION FOR SEQ 



ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 





(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1 . . 1029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTG AAG CGA GCG CTT ATA ACA GGG ATC ACG GGG CAG GAT GGT TCC TAC 48 
Val Lys Arg Ala Leu He Thr Gly He Thr Gly Gin Asp Gly Ser Tyr 
345 350 355 

CTC GCC GAG CTA CTA CTG AGC AAG GGA TAC GAG GTT CAC GGG CTC GTT 96 
Leu Ala Glu Leu Leu Leu Ser Lys Gly Tyr Glu Val His Gly Leu Val 
360 365 370 375 

CGT CGA GCT TCG ACG TTT AAC ACG TCG CGG ATC GAT CAC CTC TAC GTT 144 
Arg Arg Ala Ser Thr Phe Asn Thr Ser Arg He Asp His Leu Tyr Val 
380 385 390 

GAC CCA CAC CAA CCG GGC GCG CGC TTG TTC TTG CAC TAT GCA GAC CTC 192 
Asp Pro His Gin Pro Gly Ala Arg Leu Phe Leu His Tyr Ala Asp Leu 
395 400 405 

ACT GAC GGC ACC CGG TTG GTG ACC CTG CTC AGC AGT ATC GAC CCG GAT 240 
Thr Asp Gly Thr Arg Leu Val Thr Leu Leu Ser Ser lie Asp Pro Asp 
410 415 420 

GAG GTC TAC AAC CTC GCA GCG CAG TCC CAT GTG CGC GTC AGC TTT GAC 288 
Glu Val Tyr Asn Leu Ala Ala Gin Ser His Val Arg Val Ser Phe Asp 
425 430 435 

GAG CCA GTG CAT ACC GGA GAC ACC ACC GGC ATG GGA TCG ATC CGA CTT 336 
Glu Pro Val His Thr Gly Asp Thr Thr Gly Met Gly Ser He Arg Leu 
440 445 450 455 

CTG GAA GCA GTC CGC CTT TCT CGG GTG GAC TGC CGG TTC TAT CAG GCT 384 
Leu Glu Ala Val Arg Leu Ser Arg Val Asp Cys Arg Phe Tyr Gin Ala 
460 465 470 

TCC TCG TCG GAG ATG TTC GGC GCA TCT CCG CCA CCG CAG AAC GAA TCG 432 



Ser Ser Ser Glu Met Phe Gly Ala Ser Pro Pro Pro Gin Asn Glu Ser 
475 480 485 

ACG CCG TTC TAT CCC CGT TCG CCA TAC GGC GCG GCC AAG GTC TTC TCG 480 
Thr Pro Phe Tyr Pro Arg Ser Pro Tyr Gly Ala Ala Lys Val Phe Ser 
490 495 500 

TAC TGG ACG ACT CGC AAC TAT CGA GAG GCG TAC GGA TTA TTC GCA GTG 528 
Tyr Trp Thr Thr Arg Asn Tyr Arg Glu Ala Tyr Gly Leu Phe Ala Val 
505 510 515 

AAT GGC ATC TTG TTC AAC CAT GAG TCC CCC CGG CGC GGC GAG ACT TTC 576 
Asn Gly lie Leu Phe Asn His Glu Ser Pro Arg Arg Gly Glu Thr Phe 
520 525 530 535 

GTG ACC CGA AAG ATC ACG CGT GCC GTG GCG CGC ATC CGA GCT GGC GTC 624 
Val Thr Arg Lys He Thr Arg Ala Val Ala Arg He Arg Ala Gly Val 
540 545 550 

CAA TCG GAG GTC TAT ATG GGC AAC CTC GAT GCG ATC CGC GAC TGG GGC 672 
Gin Ser Glu Val Tyr Met Gly Asn Leu Asp Ala He Arg Asp Trp Gly 
555 560 565 

TAC GCG CCC GAA TAT GTC GAG GGG ATG TGG AGG ATG TTG CAA GCG CCT 720 
Tyr Ala Pro Glu Tyr Val Glu Gly Met Trp Arg Met Leu Gin Ala Pro 
570 575 580 

GAA CCT GAT GAC TAC GTC CTG GCG ACA GGG CGT GGT TAC ACC GTA CGT 768 
Glu Pro Asp Asp Tyr Val Leu Ala Thr Gly Arg Gly Tyr Thr Val Arg 
585 590 595 

GAG TTC GCT CAA GCT GCT TTT GAC CAC GTC GGG CTC GAC TGG CAA AAG 816 
Glu Phe Ala Gin Ala Ala Phe Asp His Val Gly Leu Asp Trp Gin Lys 
600 605 610 615 

CAC GTC AAG TTT GAC GAC CGC TAT TTG CGC CCC ACC GAG GTC GAT TCG 864 
His Val Lys Phe Asp Asp Arg Tyr Leu Arg Pro Thr Glu Val Asp Ser 
620 625 630 

CTA GTA GGA GAT GCC GAC AGG GCG GCC CAG TCA CTC GGC TGG AAA GCT 912 
Leu Val Gly Asp Ala Asp Arg Ala Ala Gin Ser Leu Gly Trp Lys Ala 
635 640 645 

TCG GTT CAT ACT GGT GAA CTC GCG CGC ATC ATG GTG GAC GCG GAC ATC 960 
Ser Val His Thr Gly Glu Leu Ala Arg He Met Val Asp Ala Asp He 
650 655 660 

GCC GCG TCG GAG TGC GAT GGC ACA CCA TGG ATC GAC ACG CCG ATG TTG 1008 



Ala Ala Ser Glu Cys Asp Gly Thr Pro Trp lie Asp Thr Pro Met Leu 
665 670 675 



CCT GGT TGG GGC GGA GTA AGT TGA 
Pro Gly Trp Gly Gly Val Ser 
680 685 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Val Lys Arg Ala Leu He Thr Gly He Thr Gly Gin Asp Gly Ser Tyr 
1 5 10 15 

Leu Ala Glu Leu Leu Leu Ser Lys Gly Tyr Glu Val His Gly Leu Val 
20 25 30 

Arg Arg Ala Ser Thr Phe Asn Thr Ser Arg He Asp His Leu Tyr Val 
35 40 45 

Asp Pro His Gin Pro Gly Ala Arg Leu Phe Leu His Tyr Ala Asp Leu 
50 55 60 

Thr Asp Gly Thr Arg Leu Val Thr Leu Leu Ser Ser He Asp Pro Asp 
65 70 75 80 

Glu Val Tyr Asn Leu Ala Ala Gin Ser His Val Arg Val Ser Phe Asp 
85 90 95 

Glu Pro Val His Thr Gly Asp Thr Thr Gly Met Gly Ser He Arg Leu 
100 105 110 

Leu Glu Ala Val Arg Leu Ser Arg Val Asp Cys Arg Phe Tyr Gin Ala 
115 120 125 

Ser Ser Ser Glu Met Phe Gly Ala Ser Pro Pro Pro Gin Asn Glu Ser 
130 135 140 

Thr Pro Phe Tyr Pro Arg Ser Pro Tyr Gly Ala Ala Lys Val Phe Ser 
145 150 155 160 



Tyr Trp Thr Thr Arg Asn Tyr Arg Glu Ala Tyr Gly Leu Phe Ala Val 
165 170 175 

Asn Gly He Leu Phe Asn His Glu Ser Pro Arg Arg Gly Glu Thr Phe 
180 185 190 

Val Thr Arg Lys He Thr Arg Ala Val Ala Arg He Arg Ala Gly Val 
195 200 205 

Gin Ser Glu Val Tyr Met Gly Asn Leu Asp Ala He Arg Asp Trp Gly 
210 215 220 

Tyr Ala Pro Glu Tyr Val Glu Gly Met Trp Arg Met Leu Gin Ala Pro 
225 230 235 240 

Glu Pro Asp Asp Tyr Val Leu Ala Thr Gly Arg Gly Tyr Thr Val Arg 
245 250 255 

Glu Phe Ala Gin Ala Ala Phe Asp His Val Gly Leu Asp Trp Gin Lys 
260 265 270 

His Val Lys Phe Asp Asp Arg Tyr Leu Arg Pro Thr Glu Val Asp Ser 
275 280 285 

Leu Val Gly Asp Ala Asp Arg Ala Ala Gin Ser Leu Gly Trp Lys Ala 
290 295 300 

Ser Val His Thr Gly Glu Leu Ala Arg He Met Val Asp Ala Asp He 
305 310 315 320 

Ala Ala Ser Glu Cys Asp Gly Thr Pro Trp He Asp Thr Pro Met Leu 
325 330 335 

Pro Gly Trp Gly Gly Val Ser 
340 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



Lb 



GAC GCG TAT GCG ATC GCC AAG ATC GCC GGT ATC CTG CAA GTT CAG GCG 528 

Asp Ala Tyr Ala He Ala Lys lie Ala Gly He Leu Gin Val Gin Ala 
505 510 515 

GTT AGG CGC CAA TAT GGG CTG GCG TGG ATC TCT GCG ATG CCG ACT AAC 576 

Val Arg Arg Gin Tyr Gly Leu Ala Trp He Ser Ala Met Pro Thr Asn 
520 525 530 535 

CTC TAC GGA CCC GGC GAC AAC TTC TCC CCG TCC GGG TCG CAT CTC TTG 624 

Leu Tyr Gly Pro Gly Asp Asn Phe Ser Pro Ser Gly Ser His Leu Leu 

540 545 550 

CCG GCG CTC ATC CGT CGA TAT GAG GAA GCC AAA GCT GGT GGT GCA GAA 672 

Pro Ala Leu He Arg Arg Tyr Glu Glu Ala Lys Ala Gly Gly Ala Glu 

555 560 565 

GAG GTG ACG AAT TGG GGG ACC GGT ACT CCG CGG CGC GAA CTT CTG CAT 720 

Glu Val Thr Asn Trp Gly Thr Gly Thr Pro Arg Arg Glu Leu Leu His 

570 575 580 



j:~ GTC GAC GAT CTG GCG AGC GCA TGC CTG TTC CTT TTG GAA CAT TTC GAT 768 

ft I Val Asp Asp Leu Ala Ser Ala Cys Leu Phe Leu Leu Glu His Phe Asp 

|j 585 590 595 

a GGT CCG AAC CAC GTC AAC GTG GGC ACC GGC GTC GAT CAC AGC ATT AGC 816 

4i Gly Pro Asn His Val Asn Val Gly Thr Gly Val Asp His Ser He Ser 

^ 600 605 610 615 

2: GAG ATC GCA GAC ATG GTC GCT ACA GCG GTG GGC TAC ATC GGC GAA ACA 864 

y " Glu He Ala Asp Met Val Ala Thr Ala Val Gly Tyr He Gly Glu Thr 

620 625 630 

CGT TGG GAT CCA ACT AAA CCC GAT GGA ACC CCG CGC AAA CTA TTG GAC 912 
Arg Trp Asp Pro Thr Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp 
635 640 645 

GTC TCC GCG CTA CGC GAG TTG GGT TGG CGC CCG CGA ATC GCA CTG AAA 960 
Val Ser Ala Leu Arg Glu Leu Gly Trp Arg Pro Arg He Ala Leu Lys 
650 655 660 

GAC GGC ATC GAT GCA ACG GTG TCG TGG TAC CGC ACA AAT GCC GAT GCC 1008 
Asp Gly He Asp Ala Thr Val Ser Trp Tyr Arg Thr Asn Ala Asp Ala 
665 670 675 

GTG AGG AGG TAA 1020 

Val Arg Arg * 

680 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .1020 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GTG CGA TGG CAC ACC ATG GAT CGA CAC GCC GAT GTT GCC TGG TTG GGG 48 
Val Arg Trp His Thr Met Asp Arg His Ala Asp Val Ala Trp Leu Gly 
345 350 355 

CAG AGT AAG TTG ACG ACT ACA CCT GGG CCT CTG GAC CGC GCA ACG CCC 96 
Gin Ser Lys Leu Thr Thr Thr Pro Gly Pro Leu Asp Arg Ala Thr Pro 
360 365 370 375 

GTG TAT ATC GCC GGT CAT CGG GGG CTG GTC GGC TCA GCG CTC GTA CGT 144 
Val Tyr He Ala Gly His Arg Gly Leu Val Gly Ser Ala Leu Val Arg 
380 385 390 

AGA TTT GAG GCC GAG GGG TTC ACC AAT CTC ATT GTG CGA TCA CGC GAT 192 
Arg Phe Glu Ala Glu Gly Phe Thr Asn Leu He Val Arg Ser Arg Asp 
395 400 405 

GAG ATT GAT CTG ACG GAC CGA GCC GCA ACG TTT GAT TTT GTG TCT GAG 240 
Glu He Asp Leu Thr Asp Arg Ala Ala Thr Phe Asp Phe Val Ser Glu 
410 415 420 

ACA AGA CCA CAG GTG ATC ATC GAT GCG GCC GCA CGG GTC GGC GGC ATC 288 
Thr Arg Pro Gin Val He He Asp Ala Ala Ala Arg Val Gly Gly He 
425 430 435 

ATG GCG AAT AAC ACC TAT CCC GCG GAC TTC TTG TCC GAA AAC CTC CGA 336 
Met Ala Asn Asn Thr Tyr Pro Ala Asp Phe Leu Ser Glu Asn Leu Arg 
440 445 450 455 

ATC CAG ACC AAT TTG CTC GAC GCA GCT GTC GCC GTG CGT GTG CCG CGG 384 
He Gin Thr Asn Leu Leu Asp Ala Ala Val Ala Val Arg Val Pro Arg 
460 465 470 

CTC CTT TTC CTC GGT TCG TCA TGC ATC TAC CCG AAG TAC GCT CCG CAA 432 
Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys Tyr Ala Pro Gin 
475 480 485 

CCT ATC CAC GAG AGT GCT TTA TTG ACT GGC CCT TTG GAG CCC ACC AAC 480 
Pro He His Glu Ser Ala Leu Leu Thr Gly Pro Leu Glu Pro Thr Asn 
490 495 500 



05 




(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Val Arg Trp His Thr Met Asp Arg His Ala Asp Val Ala Trp Leu Gly 

15 10 15 

Gin Ser Lys Leu Thr Thr Thr Pro Gly Pro Leu Asp Arg Ala Thr Pro 

20 25 30 



Val Tyr He Ala Gly His Arg Gly Leu Val Gly Ser Ala Leu Val Arg 
35 40 45 

Arg Phe Glu Ala Glu Gly Phe Thr Asn Leu lie Val Arg Ser Arg Asp 
50 55 60 

Glu lie Asp Leu Thr Asp Arg Ala Ala Thr Phe Asp Phe Val Ser Glu 
65 70 75 80 

Thr Arg Pro Gin Val He He Asp Ala Ala Ala Arg Val Gly Gly He 
85 90 95 

Met Ala Asn Asn Thr Tyr Pro Ala Asp Phe Leu Ser Glu Asn Leu Arg 
100 105 110 

He Gin Thr Asn Leu Leu Asp Ala Ala Val Ala Val Arg Val Pro Arg 
115 120 125 

Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys Tyr Ala Pro Gin 
130 135 140 



Pro He His Glu Ser Ala Leu Leu Thr Gly Pro Leu Glu Pro Thr Asn 

145 150 155 160 

Asp Ala Tyr Ala He Ala Lys He Ala Gly He Leu Gin Val Gin Ala 

165 170 175 



Val Arg Arg Gin Tyr Gly Leu Ala 
180 

Leu Tyr Gly Pro Gly Asp Asn Phe 

195 200 



Trp He Ser Ala Met Pro Thr Asn 
185 190 

Ser Pro Ser Gly Ser His Leu Leu 
205 



Pro Ala Leu He Arg Arg Tyr Glu Glu Ala Lys Ala Gly Gly Ala Glu 
210 215 220 



Glu Val Thr Asn Trp Gly Thr Gly Thr Pro Arg Arg Glu Leu Leu His 
225 230 235 240 

Val Asp Asp Leu Ala Ser Ala Cys Leu Phe Leu Leu Glu His Phe Asp 
245 250 255 

Gly Pro Asn His Val Asn Val Gly Thr Gly Val Asp His Ser He Ser 
260 265 270 

Glu He Ala Asp Met Val Ala Thr Ala Val Gly Tyr He Gly Glu Thr 
275 280 285 

Arg Trp Asp Pro Thr Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp 
290 295 300 

Val Ser Ala Leu Arg Glu Leu Gly Trp Arg Pro Arg lie Ala Leu Lys 
305 310 315 320 

Asp Gly He Asp Ala Thr Val Ser Trp Tyr Arg Thr Asn Ala Asp Ala 
325 330 335 

Val Arg Arg * 
340 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 1 . . 1020 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GTG CGA TGG CAC ACC ATG GAT CGA CAC GCC GAT GTT GCC TGG TTG GGG 
Val Arg Trp His Thr Met Asp Arg His Ala Asp Val Ala Trp Leu Gly 
345 350 355 



LI 



CGG AGT AAG TTG ACG ACT ACA CCT GGG CCT CTG GAC CGC GCA ACG CCC 
Arg Ser Lys Leu Thr Thr Thr Pro Gly Pro Leu Asp Arg Ala Thr Pro 
360 365 370 



96 



GTG TAT ATC GCC GGT CAT CGG GGG CTG GTC GGC TCA GCG CTC GTA CGT 
Val Tyr lie Ala Gly His Arg Gly Leu Val Gly Ser Ala Leu Val Arg 
375 380 385 



144 



AGA TTT GAG GCC GAG GGG TTC ACC AAT CTC ATT GTG CGA TCA CGC GAT 
Arg Phe Glu Ala Glu Gly Phe Thr Asn Leu He Val Arg Ser Arg Asp 
390 395 400 



192 



GAG ATT GAT CTG ACG GAC CGA GCC GCA ACG TTT GAT TTT GTG TCT GAG 
Glu He Asp Leu Thr Asp Arg Ala Ala Thr Phe Asp Phe Val Ser Glu 
405 410 415 420 



240 



ACA AGA CCA CAG GTG ATC ATC GAT GCG GCC GCA CGG GTC GGC GGC ATC 
Thr Arg Pro Gin Val He He Asp Ala Ala Ala Arg Val Gly Gly He 
425 430 435 



288 



ATG GCG AAT AAC ACC TAT CCC GCG GAC TTC TTG TCC GAA AAC CTC CGA 
Met Ala Asn Asn Thr Tyr Pro Ala Asp Phe Leu Ser Glu Asn Leu Arg 
440 445 450 



336 



ATC CAG ACC AAT TTG CTC GAC GCA GCT GTC GCC GTG CGT GTG CCG CGG 
He Gin Thr Asn Leu Leu Asp Ala Ala Val Ala Val Arg Val Pro Arg 
455 460 465 



384 



CTC CTT TTC CTC GGT TCG TCA TGC ATC TAC CCG AAG TAC GCT CCG CAA 
Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys Tyr Ala Pro Gin 
470 475 480 



432 



CCT ATC CAC GAG AGT GCT TTA TTG ACT GGC CCT TTG 
Pro He His Glu Ser Ala Leu Leu Thr Gly Pro Leu 
485 490 495 



GAG CCC ACC AAC 
Glu Pro Thr Asn 
500 



480 



GAC GCG TAT GCG ATC GCC AAG ATC GCC GGT ATC CTG 
Asp Ala Tyr Ala He Ala Lys He Ala Gly He Leu 
505 510 



CAA GTT CAG GCG 
Gin Val Gin Ala 
515 



528 



GTT AGG CGC CAA TAT GGG CTG GCG TGG ATC TCT GCG 
Val Arg Arg Gin Tyr Gly Leu Ala Trp He Ser Ala 
520 525 



ATG CCG ACT AAC 
Met Pro Thr Asn 
530 



576 



CTC TAC GGA CCC GGC GAC AAC TTC TCC CCG TCC GGG 
Leu Tyr Gly Pro Gly Asp Asn Phe Ser Pro Ser Gly 
535 540 



TCG CAT CTC TTG 
Ser His Leu Leu 
545 



624 



CCG GCG CTC ATC CGT CGA TAT GAG GAA GCC AAA GCT GGT GGT GCA GAA 672 

Pro Ala Leu lie Arg Arg Tyr Glu Glu Ala Lys Ala Gly Gly Ala Glu 
550 555 560 

GAG GTG ACG AAT TGG GGG ACC GGT ACT CCG CGG CGC GAA CTT CTG CAT 720 

Glu Val Thr Asn Trp Gly Thr Gly Thr Pro Arg Arg Glu Leu Leu His 

565 570 575 580 

GTC GAC GAT CTG GCG AGC GCA TGC CTG TTC CTT TTG GAA CAT TTC GAT 768 

Val Asp Asp Leu Ala Ser Ala Cys Leu Phe Leu Leu Glu His Phe Asp 

585 590 595 

GGT CCG AAC CAC GTC AAC GTG GGC ACC GGC GTC GAT CAC AGC ATT AGC 816 

Gly Pro Asn His Val Asn Val Gly Thr Gly Val Asp His Ser He Ser 

600 605 610 

GAG ATC GCA GAC ATG GTC GCT ACG GCG GTG GGC TAC ATC GGC GAA ACA 864 

Glu He Ala Asp Met Val Ala Thr Ala Val Gly Tyr He Gly Glu Thr 
615 620 625 

CGT TGG GAT CCA ACT AAA CCC GAT GGA ACC CCG CGC AAA CTA TTG GAC 912 

Arg Trp Asp Pro Thr Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp 
630 635 640 

GTC TCC GCG CTA CGC GAG TTG GGT TGG CGC CCG CGA ATC GCA CTG AAA 960 

Val Ser Ala Leu Arg Glu Leu Gly Trp Arg Pro Arg He Ala Leu Lys 

645 650 655 660 

GAC GGC ATC GAT GCA ACG GTG TCG TGG TAC CGC ACA AAT GCC GAT GCC 1008 

Asp Gly He Asp Ala Thr Val Ser Trp Tyr Arg Thr Asn Ala Asp Ala 

665 670 675 

GTG AGG AGG TAA 1020 
Val Arg Arg * 
680 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 





Val Arg Trp His Thr Met Asp Arg His Ala Asp Val Ala Trp Leu Gly 
15 10 15 

Arg Ser Lys Leu Thr Thr Thr Pro Gly Pro Leu Asp Arg Ala Thr Pro 
20 25 30 

Val Tyr He Ala Gly His Arg Gly Leu Val Gly Ser Ala Leu Val Arg 
35 40 45 

Arg Phe Glu Ala Glu Gly Phe Thr Asn Leu He Val Arg Ser Arg Asp 
50 55 60 

Glu He Asp Leu Thr Asp Arg Ala Ala Thr Phe Asp Phe Val Ser Glu 
65 70 75 80 

Thr Arg Pro Gin Val He He Asp Ala Ala Ala Arg Val Gly Gly He 
85 90 95 

Met Ala Asn Asn Thr Tyr Pro Ala Asp Phe Leu Ser Glu Asn Leu Arg 
100 105 110 

He Gin Thr Asn Leu Leu Asp Ala Ala Val Ala Val Arg Val Pro Arg 
115 120 125 

Leu Leu Phe Leu Gly Ser Ser Cys He Tyr Pro Lys Tyr Ala Pro Gin 
130 135 140 

Pro He His Glu Ser Ala Leu Leu Thr Gly Pro Leu Glu Pro Thr Asn 
145 150 155 160 

Asp Ala Tyr Ala He Ala Lys He Ala Gly He Leu Gin Val Gin Ala 
165 170 175 

Val Arg Arg Gin Tyr Gly Leu Ala Trp He Ser Ala Met Pro Thr Asn 
180 185 190 

Leu Tyr Gly Pro Gly Asp Asn Phe Ser Pro Ser Gly Ser His Leu Leu 
195 200 205 

Pro Ala Leu He Arg Arg Tyr Glu Glu Ala Lys Ala Gly Gly Ala Glu 
210 215 220 

Glu Val Thr Asn Trp Gly Thr Gly Thr Pro Arg Arg Glu Leu Leu His 
225 230 235 240 

Val Asp Asp Leu Ala Ser Ala Cys Leu Phe Leu Leu Glu His Phe Asp 



245 



250 



255 




Gly Pro Asn His Val Asn Val Gly Thr Gly Val Asp His Ser He Ser 
260 265 270 

Glu He Ala Asp Met Val Ala Thr Ala Val Gly Tyr He Gly Glu Thr 
275 280 285 

Arg Trp Asp Pro Thr Lys Pro Asp Gly Thr Pro Arg Lys Leu Leu Asp 
290 295 300 

Val Ser Ala Leu Arg Glu Leu Gly Trp Arg Pro Arg He Ala Leu Lys 
305 310 315 320 

Asp Gly He Asp Ala Thr Val Ser Trp Tyr Arg Thr Asn Ala Asp Ala 
325 330 335 

Val Arg Arg * 
340 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .720 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG GAT TTT TTG CGC AAC GCC GGC TTG ATG GCT CGT AAC GTT AGT ACC 48 

Met Asp Phe Leu Arg Asn Ala Gly Leu Met Ala Arg Asn Val Ser Thr 

345 350 355 

GAG ATG CTG CGC CAC TTC GAA CGA AAG CGC CTA TTA GTA AAC CAA TTC 96 

Glu Met Leu Arg His Phe Glu Arg Lys Arg Leu Leu Val Asn Gin Phe 

360 365 370 

AAA GCA TAC GGA GTC AAC GTT GTT ATT GAT GTC GGT GCT AAC TCC GGC 144 

Lys Ala Tyr Gly Val Asn Val Val He Asp Val Gly Ala Asn Ser Gly 

375 380 385 



11 




CAG TTC GGT AGC GCT TTG CGT CGT GCA GGA TTC AAG AGC CGT ATC GTT 192 
Gin Phe Gly Ser Ala Leu Arg Arg Ala Gly Phe Lys Ser Arg He Val 
390 395 400 

TCC TTT GAA CCT CTT TCG GGG CCA TTT GCG CAA CTA ACG CGC AAG TCG 240 
Ser Phe Glu Pro Leu Ser Gly Pro Phe Ala Gin Leu Thr Arg Lys Ser 
405 410 415 420 

GCA TCG GAT CCA CTA TGG GAG TGT CAC CAG TAT GCC CTA GGC GAC GCC 288 
Ala Ser Asp Pro Leu Trp Glu Cys His Gin Tyr Ala Leu Gly Asp Ala 
425 430 435 

GAT GAG ACG ATT ACC ATC AAT GTG GCA GGC AAT GCG GGG GCA AGT AGT 336 
Asp Glu Thr lie Thr He Asn Val Ala Gly Asn Ala Gly Ala Ser Ser 
440 445 450 

C1 TCC GTG CTG CCG ATG CTT AAA AGT CAT CAA GAT GCC TTT CCT CCC GCG 384 

ji Ser Val Leu Pro Met Leu Lys Ser His Gin Asp Ala Phe Pro Pro Ala 

3 455 460 465 

]: AAT TAT ATT GGC ACC GAA GAC GTT GCA ATA CAC CGC CTT GAT TCG GTT 432 

J ] Asn Tyr He Gly Thr Glu Asp Val Ala He His Arg Leu Asp Ser Val 

" 470 475 480 

3 GCA TCA GAA TTT CTG AAC CCT ACC GAT GTT ACT TTC CTG AAG ATC GAC 480 

i Ala Ser Glu Phe Leu Asn Pro Thr Asp Val Thr Phe Leu Lys He Asp 

f- 485 490 495 500 

i GTA CAG GGT TTC GAG AAG CAG GTT ATC ACG GGC AGT AAG TCA ACG CTT 528 

1 Val Gin Gly Phe Glu Lys Gin Val He Thr Gly Ser Lys Ser Thr Leu 

505 510 515 

AAC GAA AGC TGC GTC GGC ATG CAA CTC GAA CTT TCT TTT ATT CCG TTG 576 
Asn Glu Ser Cys Val Gly Met Gin Leu Glu Leu Ser Phe He Pro Leu 
520 525 530 

TAC GAA GGT GAC ATG CTG ATT CAT GAA GCG CTT GAA CTT GTC TAT TCC 624 
Tyr Glu Gly Asp Met Leu He His Glu Ala Leu Glu Leu Val Tyr Ser 
535 540 545 

CTA GGT TTC AGA CTG ACG GGT TTG TTG CCC GGC TTT ACG GAT CCG CGC 672 
Leu Gly Phe Arg Leu Thr Gly Leu Leu Pro Gly Phe Thr Asp Pro Arg 
550 555 560 

AAT GGT CGA ATG CTT CAA GCT GAC GGC ATT TTC TTC CGT GGG GAC GAT 720 
Asn Gly Arg Met Leu Gin Ala Asp Gly He Phe Phe Arg Gly Asp Asp 
565 570 575 580 





TGA 



723 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asp Phe Leu Arg Asn Ala Gly Leu Met Ala Arg Asn Val Ser Thr 
15 10 15 

Glu Met Leu Arg His Phe Glu Arg Lys Arg Leu Leu Val Asn Gin Phe 
20 25 30 

Lys Ala Tyr Gly Val Asn Val Val He Asp Val Gly Ala Asn Ser Gly 
35 40 45 

Gin Phe Gly Ser Ala Leu Arg Arg Ala Gly Phe Lys Ser Arg He Val 
50 55 60 

Ser Phe Glu Pro Leu Ser Gly Pro Phe Ala Gin Leu Thr Arg Lys Ser 
65 70 75 80 

Ala Ser Asp Pro Leu Trp Glu Cys His Gin Tyr Ala Leu Gly Asp Ala 
85 90 95 

Asp Glu Thr He Thr He Asn Val Ala Gly Asn Ala Gly Ala Ser Ser 
100 105 110 

Ser Val Leu Pro Met Leu Lys Ser His Gin Asp Ala Phe Pro Pro Ala 
115 120 125 

Asn Tyr He Gly Thr Glu Asp Val Ala He His Arg Leu Asp Ser Val 
130 135 140 

Ala Ser Glu Phe Leu Asn Pro Thr Asp Val Thr Phe Leu Lys lie Asp 
145 150 155 160 

Val Gin Gly Phe Glu Lys Gin Val He Thr Gly Ser Lys Ser Thr Leu 
165 170 175 

Asn Glu Ser Cys Val Gly Met Gin Leu Glu Leu Ser Phe He Pro Leu 



180 



185 



190 




Tyr Glu Gly Asp Met Leu lie His Glu Ala Leu Glu Leu Val Tyr Ser 

195 200 205 

Leu Gly Phe Arg Leu Thr Gly Leu Leu Pro Gly Phe Thr Asp Pro Arg 

210 215 220 



Asn Gly Arg Met Leu Gin Ala Asp Gly He Phe Phe Arg Gly Asp Asp 
225 230 235 240 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .720 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATG GAT TTT TTG CGC AAC GCC GGC TTG ATG GCT CGT AAC GTT AGC ACC 48 

Met Asp Phe Leu Arg Asn Ala Gly Leu Met Ala Arg Asn Val Ser Thr 

245 250 255 

GAG ATG CTG CGC CAC TTC GAA CGA AAG CGC CTA TTA GTA AAC CAA TTC 96 

Glu Met Leu Arg His Phe Glu Arg Lys Arg Leu Leu Val Asn Gin Phe 

260 265 270 

AAA GCA TAC GGA GTC AAC GTT GTT ATT GAT GTC GGT GCT AAC TCC GGC 144 

Lys Ala Tyr Gly Val Asn Val Val He Asp Val Gly Ala Asn Ser Gly 

275 280 285 

CAG TTC GGT AGC GCT TTG CGT CGT GCA GGA TTC AAG AGC CGT ATC GTT 192 

Gin Phe Gly Ser Ala Leu Arg Arg Ala Gly Phe Lys Ser Arg He Val 
290 295 300 

TCC TTT GAA CCT CTT TCG GGG CCA TTT GCG CAA CTA ACG CGC GAG TCG 240 

Ser Phe Glu Pro Leu Ser Gly Pro Phe Ala Gin Leu Thr Arg Glu Ser 

305 310 315 320 



14 



# 



GCA TCG GAT CCA CTA TGG GAG TGT CAC CAG TAT GCC CTA GGC GAC GCC 
AU Ser Asp Pro Leu Trp Glu Cys His Gin Tyr Ala Leu Gly Asp Ala 
325 330 



GAT GAG ACG ATT ACC ATC AAT GTG GCA GGC AAT GCG GGG GCA AGT AGT 
Asp Glu Thr lie Thr He Asn Val Ala Gly Asn Ala Gly Ala Ser Ser 
340 345 350 



TCC GTG CTG CCG ATG CTT AAA AGT CAT CAA GAT GCC TTT CCT CCC GCG 
Ser Val Leu Pro Met Leu Lys Ser His Gin Asp Ala Phe Pro Pro Ala 
355 360 365 

AAT TAT ATT GGC ACC GAA GAC GTT GCA ATA CAC CGC CTT GAT TCG GTT 
Asn Tyr He Gly Thr Glu Asp Val Ala He His Arg Leu Asp Ser Val 
370 375 380 

GCA TCA GAA TTT CTG AAC CCT ACC GAT GTT ACT TTC CTG AAG ATC GAC 
AU Ser Glu Phe Leu Asn Pro Thr Asp Val Thr Phe Leu Lys He Asp 
385 390 395 

GTA CAG GGT TTC GAG AAG CAG GTT ATC GCG GGC AGT AAG TCA ACG Cli- 
val Gin Gly Phe Glu Lys Gin Val He Ala Gly Ser Lys Ser Thr Leu 
405 410 415 



AAC GAA AGC TGC GTC GGC ATG CAA CTC GAA CTT TCT TTT ATT CCG TTG 
Asn tTu Ser Cys Vainly Met Gin Leu Glu Leu Ser Phe lie Pro Leu 
420 425 



TAC GAA GGT GAC ATG CTG ATT CAT GAA GCG CTT GAA CTT GTC TAT TCC 
Tyr Glu Gly Asp Met Leu He His Glu Ala Leu Glu Leu Val Tyr Ser 
435 440 445 

CTA GGT TTC AGA CTG ACG GGT TTG TTG CCC GGA TTT ACG GAT CCG CGC 
leu Gly Phe Arg Leu Thr Gly Leu Leu Pro Gly Phe Thr Asp Pro Arg 
450 455 460 

AAT CGT CGA ATG CTT CAA GCT GAC GGC ATT TTC TTC CGT GGG GAC GAT 
Asn Z Arg Set Leu Gin Ala Asp Gly He Phe Phe Arg Gly Asp Asp 
465 ~ 470 475 



TGA 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



723 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 



7o 



• 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asp Phe Leu Arg Asn Ala Gly Leu Met Ala Arg Asn Val Ser Thr 
15 10 15 

Glu Met Leu Arg His Phe Glu Arg Lys Arg Leu Leu Val Asn Gin Phe 
20 25 30 

Lys Ala Tyr Gly Val Asn Val Val He Asp Val Gly Ala Asn Ser Gly 
35 40 45 

Gin Phe Gly Ser Ala Leu Arg Arg Ala Gly Phe Lys Ser Arg He Val 
50 55 60 

Ser Phe Glu Pro Leu Ser Gly Pro Phe Ala Gin Leu Thr Arg Glu Ser 
65 70 75 80 

Ala Ser Asp Pro Leu Trp Glu Cys His Gin Tyr Ala Leu Gly Asp Ala 
85 90 95 

Asp Glu Thr He Thr He Asn Val Ala Gly Asn Ala Gly Ala Ser Ser 
100 105 110 

Ser Val Leu Pro Met Leu Lys Ser His Gin Asp Ala Phe Pro Pro Ala 
115 120 125 

Asn Tyr He Gly Thr Glu Asp Val Ala He His Arg Leu Asp Ser Val 
130 135 140 

Ala Ser Glu Phe Leu Asn Pro Thr Asp Val Thr Phe Leu Lys He Asp 
145 150 155 160 

Val Gin Gly Phe Glu Lys Gin Val He Ala Gly Ser Lys Ser Thr Leu 
165 170 175 

Asn Glu Ser Cys Val Gly Met Gin Leu Glu Leu Ser Phe He Pro Leu 
180 185 190 

Tyr Glu Gly Asp Met Leu He His Glu Ala Leu Glu Leu Val Tyr Ser 
195 200 205 

Leu Gly Phe Arg Leu Thr Gly Leu Leu Pro Gly Phe Thr Asp Pro Arg 
210 215 220 




Asn Gly Arg Met Leu Gin Ala Asp Gly lie Phe Phe Arg Gly Asp Asp 
225 230 235 240 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .798 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG ACT GCG CCA GTG TTC TCG ATA ATT ATC CCT ACC TTC AAT GCA GCG 48 

Met Thr Ala Pro Val Phe Ser He He He Pro Thr Phe Asn Ala Ala 

245 250 255 

GTG ACG CTG CAA GCC TGC CTC GGA AGC ATC GTC GGG CAG ACC TAC CGG 96 

Val Thr Leu Gin Ala Cys Leu Gly Ser He Val Gly Gin Thr Tyr Arg 

260 265 270 

GAA GTG GAA GTG GTC CTT GTC GAC GGC GGT TCG ACC GAT CGG ACC CTC 144 

Glu Val Glu Val Val Leu Val Asp Gly Gly Ser Thr Asp Arg Thr Leu 
275 280 285 

GAC ATC GCG AAC AGT TTC CGC CCG GAA CTC GGC TCG CGA CTG GTC GTT 192 

Asp He Ala Asn Ser Phe Arg Pro Glu Leu Gly Ser Arg Leu Val Val 
290 295 300 

CAC AGC GGG CCC GAT GAT GGC CCC TAC GAC GCC ATG AAC CGC GGC GTC 240 

His Ser Gly Pro Asp Asp Gly Pro Tyr Asp Ala Met Asn Arg Gly Val 

305 310 315 320 

GGC GTG GCC ACA GGC GAA TGG GTA CTT TTT TTA GGC GCC GAC GAC ACC 288 

Gly Val Ala Thr Gly Glu Trp Val Leu Phe Leu Gly Ala Asp Asp Thr 

325 330 335 

CTC TAC GAA CCA ACC ACG TTG GCC CAG GTA GCC GCT TTT CTC GGC GAC 336 

Leu Tyr Glu Pro Thr Thr Leu Ala Gin Val Ala Ala Phe Leu Gly Asp 



11 




340 345 350 

CAT GCG GCA AGC CAT CTT GTC TAT GGC GAT GTT GTG ATG CGT TCG ACG 384 
His Ala Ala Ser His Leu Val Tyr Gly Asp Val Val Met Arg Ser Thr 
355 360 365 

AAA AGC CGG CAT GCC GGA CCT TTC GAC CTC GAC CGC CTC CTA TTT GAG 432 
Lys Ser Arg His Ala Gly Pro Phe Asp Leu Asp Arg Leu Leu Phe Glu 
370 375 380 

ACG AAT TTG TGC CAC CAA TCG ATC TTT TAC CGC CGT GAG CTT TTC GAC 480 
Thr Asn Leu Cys His Gin Ser He Phe Tyr Arg Arg Glu Leu Phe Asp 
385 390 395 400 

GGC ATC GGC CCT TAC AAC CTG CGC TAC CGA GTC TGG GCG GAC TGG GAC 528 
Gly He Gly Pro Tyr Asn Leu Arg Tyr Arg Val Trp Ala Asp Trp Asp 
405 410 415 

3 5 

jjj TTC AAT ATT CGC TGC TTC TCC AAC CCG GCG CTG ATT ACC CGC TAC ATG 576 

JfJ Phe Asn He Arg Cys Phe Ser Asn Pro Ala Leu He Thr Arg Tyr Met 

P 420 425 430 

Ul 

W GAC GTC GTG ATT TCC GAA TAC AAC GAC ATG ACC GGC TTC AGC ATG AGG 624 

M Asp Val Val He Ser Glu Tyr Asn Asp Met Thr Gly Phe Ser Met Arg 

*, 435 440 445 

X CAG GGG ACT GAT AAA GAG TTC AGA AAA CGG CTG CCA ATG TAC TTC TGG 672 

gi Gin Gly Thr Asp Lys Glu Phe Arg Lys Arg Leu Pro Met Tyr Phe Trp 

,43 450 455 460 

Q2 

GTT GCA GGG TGG GAG ACT TGC AGG CGC ATG CTG GCG TTT TTG AAA GAC 720 
Val Ala Gly Trp Glu Thr Cys Arg Arg Met Leu Ala Phe Leu Lys Asp 
465 470 475 480 

AAG GAG AAT CGC CGT CTG GCC TTG CGT ACG CGG TTG ATA AGG GTT AAG 768 
Lys Glu Asn Arg Arg Leu Ala Leu Arg Thr Arg Leu lie Arg Val Lys 
485 490 495 

GCC GTC TCC AAA GAA CGA AGC GCA GAA CCG TAG 801 
Ala Val Ser Lys Glu Arg Ser Ala Glu foro 
500 505::$. 



(2) INFORMATION FOR SEQ ID NO: 22: m 

(i) SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 



1? 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Thr Ala Pro Val Phe Ser He He He Pro Thr Phe Asn Ala Ala 
15 10 15 

Val Thr Leu Gin Ala Cys Leu Gly Ser He Val Gly Gin Thr Tyr Arg 
20 25 30 

Glu Val Glu Val Val Leu Val Asp Gly Gly Ser Thr Asp Arg Thr Leu 
35 40 45 

Asp He Ala Asn Ser Phe Arg Pro Glu Leu Gly Ser Arg Leu Val Val 
50 55 60 

His Ser Gly Pro Asp Asp Gly Pro Tyr Asp Ala Met Asn Arg Gly Val 
65 70 75 80 

Gly Val Ala Thr Gly Glu Trp Val Leu Phe Leu Gly Ala Asp Asp Thr 
85 90 95 

Leu Tyr Glu Pro Thr Thr Leu Ala Gin Val Ala Ala Phe Leu Gly Asp 
100 105 110 

His Ala Ala Ser His Leu Val Tyr Gly Asp Val Val Met Arg Ser Thr 
115 120 125 

Lys Ser Arg His Ala Gly Pro Phe Asp Leu Asp Arg Leu Leu Phe Glu 
130 135 140 

Thr Asn Leu Cys His Gin Ser He Phe Tyr Arg Arg Glu Leu Phe Asp 
145 150 155 160 

Gly He Gly Pro Tyr Asn Leu Arg Tyr Arg Val Trp Ala Asp Trp Asp 
165 170 175 

Phe Asn He Arg Cys Phe Ser Asn Pro Ala Leu He Thr Arg Tyr Met 
180 185 190 

Asp Val Val He Ser Glu Tyr Asn Asp Met Thr Gly Phe Ser Met Arg 
195 200 205 

Gin Gly Thr Asp Lys Glu Phe Arg Lys Arg Leu Pro Met Tyr Phe Trp 
210 215 220 

Val Ala Gly Trp Glu Thr Cys Arg Arg Met Leu Ala Phe Leu Lys Asp 



11 




225 230 235 240 

Lys Glu Asn Arg Arg Leu Ala Leu Arg Thr Arg Leu He Arg Val Lys 
245 250 255 

Ala Val Ser Lys Glu Arg Ser Ala Glu Pro 
260 265 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .798 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG ACT GCG CCA GTG TTC TCG ATA ATT ATC CCT ACC TTC AAT GCA GCG 48 
Met Thr Ala Pro Val Phe Ser He He He Pro Thr Phe Asn Ala Ala 
270 275 280 

GTG ACG CTG CAA GCC TGC CTC GGA AGC ATC GTC GGG CAG ACC TAC CGG 96 
Val Thr Leu Gin Ala Cys Leu Gly Ser He Val Gly Gin Thr Tyr Arg 
285 290 295 

GAA GTG GAA GTG GTC CTT GTC GAC GGC GGT TCG ACC GAT CGG ACC CTC 144 
Glu Val Glu Val Val Leu Val Asp Gly Gly Ser Thr Asp Arg Thr Leu 
300 305 310 

GAC ATC GCG AAC AGT TTC CGC CCG GAA CTC GGC TCG CGA CTG GTC GTT 192 
Asp He Ala Asn Ser Phe Arg Pro Glu Leu Gly Ser Arg Leu Val Val 
315 320 325 330 

CAC AGC GGG CCC GAT GAT GGC CCC TAC GAC GCC ATG AAC CGC GGC GTC 240 
His Ser Gly Pro Asp Asp Gly Pro Tyr Asp Ala Met Asn Arg Gly Val 
335 340 345 

GGC GTA GCC ACA GGC GAA TGG GTA CTT TTT TTA GGC GCC GAC GAC ACC 288 
Gly Val Ala Thr Gly Glu Trp Val Leu Phe Leu Gly Ala Asp Asp Thr 



fo 



350 



355 



360 



CTC TAC GAA CCA ACC ACG TTG GCC CAG GTA GCC GCT TTT CTC GGC GAC 
Leu Tyr Glu Pro Thr Thr Leu Ala Gin Val Ala Ala Phe Leu Gly Asp 
365 370 375 



336 



CAT GCG GCA AGC CAT CTT GTC TAT GGC GAT GTT GTG ATG CGT TCG ACG 
His Ala Ala Ser His Leu Val Tyr Gly Asp Val Val Met Arg Ser Thr 
380 385 390 



384 



AAA AGC CGG CAT GCC GGA CCT TTC GAC CTC GAC CGC CTC CTA TTT GAG 
Lys Ser Arg His Ala Gly Pro Phe Asp Leu Asp Arg Leu Leu Phe Glu 
395 400 405 410 



432 



ACG AAT TTG TGC CAC CAA TCG ATC TTT TAC CGC CGT GAG CTT TTC GAC 
Thr Asn Leu Cys His Gin Ser He Phe Tyr Arg Arg Glu Leu Phe Asp 
415 420 425 



480 



GGC ATC GGC CCT TAC AAC CTG CGC TAC CGA GTC TGG GCG GAC TGG GAC 
Gly He Gly Pro Tyr Asn Leu Arg Tyr Arg Val Trp Ala Asp Trp Asp 
430 435 440 



528 



TTC AAT ATT CGC TGC TTC TCC AAC CCG GCG CTG ATT ACC CGC TAC ATG 
Phe Asn He Arg Cys Phe Ser Asn Pro Ala Leu He Thr Arg Tyr Met 
445 450 455 



576 



GAC GTC GTG ATT TCC GAA TAC AAC GAC ATG ACC GGC TTC AGC ATG AGG 
Asp Val Val He Ser Glu Tyr Asn Asp Met Thr Gly Phe Ser Met Arg 
460 465 470 



624 



CAG GGG ACT GAT AAA GAG TTC AGA AAA CGG CTG CCA ATG TAC TTC TGG 
Gin Gly Thr Asp Lys Glu Phe Arg Lys Arg Leu Pro Met Tyr Phe Trp 
475 480 485 490 



672 



GTT GCA GGG TGG GAG ACT TGC AGG CGC ATG CTG GCG TTT TTG AAA GAC 
Val Ala Gly Trp Glu Thr Cys Arg Arg Met Leu Ala Phe Leu Lys Asp 
495 500 505 



720 



AAG GAG AAT CGC CGT CTG GCC TTG CGT ACG CGG TTG ATA AGG GTT AAG 
Lys Glu Asn Arg Arg Leu Ala Leu Arg Thr Arg Leu He Arg Val Lys 
510 515 520 



768 



GCC GTC TCC AAA GAA CGA AGC GCA GAA CCG TAG 
Ala Val Ser Lys Glu Arg Ser Ala Glu Pro 
525 530 



801 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Thr Ala Pro Val Phe Ser lie He lie Pro Thr Phe Asn Ala Ala 



10 



Val Thr Leu Gin Ala Cys Leu Gly Ser He Val Gly Gin Thr Tyr Arg 
20 25 30 

Glu Val Glu Val Val Leu Val Asp Gly Gly Ser Thr Asp Arg Thr Leu 
35 40 45 

Asp He Ala Asn Ser Phe Arg Pro Glu Leu Gly Ser Arg Leu Val Val 
50 55 60 

His Ser Gly Pro Asp Asp Gly Pro Tyr Asp Ala Met Asn Arg Gly Val 

70 75 



65 



Gly Val Ala Thr Gly Glu Trp Val Leu Phe Leu Gly Ala Asp Asp Thr 
85 90 95 

Leu Tyr Glu Pro Thr Thr Leu Ala Gin Val Ala Ala Phe Leu Gly Asp 
100 105 

His Ala Ala Ser His Leu Val Tyr Gly Asp Val Val Met Arg Ser Thr 



115 



120 125 



Lys Ser Arg His Ala Gly Pro Phe Asp Leu Asp Arg Leu Leu Phe Glu 

135 140 

Arg 

150 155 



130 

Leu Cys His Gin Ser He Phe Tyr Arg Arg Glu Leu Phe Asp 



Thr Asn Leu Cys His urn 3 ei i.c r.. c ...» * 
145 150 155 

Gly lie Gly Pro Tyr Asn Leu Arg Tyr Arg Val Trp Ala Asp Trp Asp 
165 170 

Phe Asn lie Arg Cys Phe Ser Asn Pro Ala Leu He Thr Arg Tyr Met 
180 I 85 190 

Asp Val Val He Ser Glu Tyr Asn Asp Met Thr Gly Phe Ser Met Arg 
v 200 205 



195 



Gin Gly Thr Asp Lys Glu Phe Arg Lys Arg Leu Pro Met Tyr Phe Trp 



?3L 




210 215 220 

Val Ala Gly Trp Glu Thr Cys Arg Arg Met Leu Ala Phe Leu Lys Asp 
225 230 235 240 

Lys Glu Asn Arg Arg Leu Ala Leu Arg Thr Arg Leu He Arg Val Lys 
245 250 255 

Ala Val Ser Lys Glu Arg Ser Ala Glu Pro 
260 265 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .864 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



GTG GCC 


AGC 


AGA 


AGT 


CCC 


CAC 


TCC 


GCT 


GCG 


GGT 


GGT 


TGG 


CTA 


ATT 


CTT 


48 


Val Ala 


Ser 


Arg 


Ser 


Pro 


His 


Ser 


Ala 


Ala 


Gly 


Gly Trp Leu 


He 


Leu 








270 










275 










280 








GGC GGC 


TCC 


CTT 


CTT 


GTG 


GTC 


GGC 


GTG 


GCG 


CAT 


CCG 


GTA 


GGA 


CTC 


GCC 


96 


Gly Gly 


Ser 


Leu 


Leu 


Val 


Val 


Gly 


Val 


Ala 


His 


Pro 


Val 


Gly 


Leu 


Ala 






285 










290 










295 










GGA GGT 


GAC 


GAC 


GAT 


GCT 


GGC 


GTG 


GTG 


CAG 


CAG 


CCG 


ATC 


GAG 


GAT 


GCT 


144 


Gly Gly Asp 


Asp 


Asp 


Ala 


Gly 


Val 


Val 


Gin 


Gin 


Pro 


He 


Glu 


Asp Ala 




300 










305 










310 












GGC GGC 


GGT 


GGT 


GTG 


CTC 


GGG 


CAG 


GAA 


TCG 


CCC 


CCA 


TTG 


TTC 


GAA 


GGG 


192 


Gly Gly Gly 


Gly 


Val 


Leu 


Gly 


Gin 


Glu 


Ser 


Pro 


Pro 


Leu 


Phe 


Glu 


Gly 




315 








320 










325 










330 




CCA ATG 


CGA 


GGC 


GAT 


GGC 


CAG 


GGA 


GCG 


GCG 


CTC 


GTA 


GCC 


GGC 


AGC 


CAC 


240 


Pro Met 


Arg 


Gly 


Asp 


Gly 


Gin 


Gly 


Ala 


Ala 


Leu 


Val 


Ala 


Gly 


Ser 


His 










335 










340 










345 







£3 



GAG CCG GAA 
Glu Pro Glu 



GAT CTC GTC 
Asp Leu Val 
365 



CAA CAG TTG AGT 
Gin Gin Leu Ser 
350 

CAA GAT GAC CAG 
Gin Asp Asp Gin 



CCC GGT GTC 
Pro Gly Val 
355 

ATC CGC GCG 
He Arg Ala 
370 



GTC GAG CGG GGC GAA GCC 
Val Glu Arg Gly Glu Ala 
360 

GAG CAG GGT GTC GAT GAT 
Glu Gin Gly Val Asp Asp 
375 



CTT GCC GAC GGT GTT 
Leu Ala Asp Gly Val 
380 

GTC GGC GGC GGT GAA 
Val Gly Gly Gly Glu 
395 



GTC GGC 
Val Gly 
385 

GTA GCG 
Val Ala 
400 



CAG GCC GCG 
Gin Ala Ala 



GAC TTT GAA 
Asp Phe Glu 



GTA GAG 
Val Glu 
390 

TCC GGC 
Ser Gly 
405 



GAC CTC GAT CAG 
Asp Leu Asp Gin 



GTG CCC GCA 
Val Pro Ala 



GCC GAT 
Ala Asp 
415 



AAT GAC CGC 
Asn Asp Arg 



CAG GTT 
Gin Val 
430 



GTA GTC GAA CGT GGC 
Val Val Glu Arg Gly 
445 

GTC GAG GGT CTT GGT 
Val Glu Gly Leu Gly 
460 

GGT GTT GGA GGC ATC 
Gly Val Gly Gly He 
475 



GAG CAG 
Glu Gin 



CTG TTG 
Leu Leu 

TGC GGT 
Cys Gly 



GTG ACT TTT 
Val Thr Phe 
420 

TGC CCG AAT 
Cys Pro Asn 
435 

GAT CGA CGA 
Asp Arg Arg 
450 



GCC CGT 
Ala Arg 



GTG GAC 
Val Asp 

ACC AGG 
Thr Arg 



GGC AGC 
Gly Ser 
410 

TGG GCC 
Trp Ala 
425 



CCA TTC 
Pro Phe 



TCC GGT 
Ser Gly 



CAG GCT 
Gin Ala 
440 

GAC GTC 
Asp Val 
455 



CGA CAG 
Arg Gin 



GAA CCC 
Glu Pro 



GAC CGG 
Asp Arg 
465 

GCG GGC 
Ala Gly 
480 



GAA GGC TGC 
Glu Gly Cys 

AGC GAT CTC 
Ser Asp Leu 



GGC CTT 
Gly Leu 
470 

GGC CTC 
Gly Leu 
485 



GAG ACG 
Glu Thr 



GTT GGC 
Val Gly 



AAC CAA CGT CCG 
Asn Gin Arg Pro 
490 



CAG GAT CTC 
Gin Asp Leu 

CTC GGC GGC 
Leu Gly Gly 

CGC GTC AAG 
Arg Val Lys 
525 



CTC CGG 
Leu Arg 
495 



TGT CCA 
Cys Pro 



GTT GCG 
Val Ala 
510 



GCG CAC 
Ala His 



GTC AGC 
Val Ser 



AGC CAG 
Ser Gin 



GCG TTG CGT 
Ala Leu Arg 
500 

CGT GGC CAG 
Arg Gly Gin 
515 

CGG TGC CGC 
Arg Cys Arg 
530 



CTT GGC GAC TTG CAA CAC 
Leu Gly Asp Leu Gin His 
505 

CTT CAA CCG CCG CAG CGC 
Leu Gin Pro Pro Gin Arg 
520 



CGA GGA CGG TGC CAC CGG 
Arg Gly Arg Cys His Arg 
535 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



CTT GGC AGC GGT GGT CAT GAG GCC GTC CCG TCG GTG GTG TTG ATC TTG 
Leu Gly Ser Gly Gly His Glu Ala Val Pro Ser Val Val Leu He Leu 
540 545 550 



TAG 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Val Ala Ser Arg Ser Pro His Ser Ala Ala Gly Gly Trp Leu He Leu 
15 10 15 

Gly Gly Ser Leu Leu Val Val Gly Val Ala His Pro Val Gly Leu Ala 
20 25 30 

Gly Gly Asp Asp Asp Ala Gly Val Val Gin Gin Pro He Glu Asp Ala 
35 40 45 

Gly Gly Gly Gly Val Leu Gly Gin Glu Ser Pro Pro Leu Phe Glu Gly 
50 55 60 

Pro Met Arg Gly Asp Gly Gin Gly Ala Ala Leu Val Ala Gly Ser His 
65 70 75 80 

Glu Pro Glu Gin Gin Leu Ser Pro Gly Val Val Glu Arg Gly Glu Ala 
85 90 95 

Asp Leu Val Gin Asp Asp Gin He Arg Ala Glu Gin Gly Val Asp Asp 
100 105 110 

Leu Ala Asp Gly Val Val Gly Gin Ala Ala Val Glu Asp Leu Asp Gin 
115 120 125 

Val Gly Gly Gly Glu Val Ala Asp Phe Glu Ser Gly Val Asp Gly Ser 
130 135 140 

Val Pro Ala Ala Asp Glu Gin Val Thr Phe Ala Arg Thr Arg Trp Ala 
145 150 155 160 

Asn Asp Arg Gin Val Leu Leu Cys Pro Asn Pro Phe Gin Ala Arg Gin 




165 



170 



175 



Val Val Glu Arg Gly Cys Gly Asp Arg Arg Ser Gly Asp Val Glu Pro 
180 185 190 

Val Glu Gly Leu Gly Asp Arg Glu Gly Cys Gly Leu Glu Thr Val Gly 
195 200 205 

Gly Val Gly Gly He Ala Gly Ser Asp Leu Gly Leu Asn Gin Arg Pro 
210 215 220 

Gin Asp Leu Leu Arg Cys Pro Ala Leu Arg Leu Gly Asp Leu Gin His 
225 230 235 240 

Leu Gly Gly Val Ala Ala His Arg Gly Gin Leu Gin Pro Pro Gin Arg 
245 250 255 

Arg Val Lys Val Ser Ser Gin Arg Cys Arg Arg Gly Arg Cys His Arg 
260 265 270 

Leu Gly Ser Gly Gly His Glu Ala Val Pro Ser Val Val Leu He Leu 
275 280 285 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1739 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION :1. .945 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) L0CATI0N:945. .1736 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATGGGCTGCC TCAAAGGTGG TGTCGTCGCC AATGTTGTTG TTCCAACACC GGATTATGTG 




CGATTCGCGT CCCACTATGG CTTCGTTCCG GACTTCTGCC ACGGTGCGGA TCCGCAATCG 
AAGGGCATCG TGGAGAACCT CTGTGGCTAC GCTCAGGACG ACCTTGCGGT GCCGCTGCTG 
ACCGAAGCTG CGTTAGCCGG TGAGCAGGTC GACCTACGTG CCCTCAACGC CCAGGCGCAA 
CTATGGTGCG CCGAGGTCAA TGCCACGGTC CACTCGGAGA TCTGCGCCGT GCCCAACGAT 
CGCTTGGTTG ACGAGCGCAC CGTCTTGAGG GAGCTGCCCT CGCTGCGGCC GACGATCGGC 
TCGGGGTCGG TGCGCCGTAA GGTCGACGGC CTCTCGTGCA TCCGTTACGG CTCAGCTCGT 
TACTCGGTGC CTCAGCGGCT CGTCGGTGCC ACCGTGGCGG TGGTGGTCGA TCATGGCGCC 
CTGATCCTGT TGGAACCTGC GACCGGTGTG ATCGTGGCCG AGCACGAGCT CGTCAGCCCA 
GGTGAGGTGT CCATCCTCGA TGAACACTAC GACGGACCCA GACCCGCACC CTCGCGTGGT 
CCTCGCCCGA AAACCCAAGC AGAGAAACGA TTCTGCGCAT TGGGAACCGA AGCGCAGCAG 
TTCCTCGTCG GTGCTGCTGC GATCGGCAAC ACCCGACTGA AATCCGAACT CGACATTCTG 
CTCGGCCTTG GCGCCGCCCA CGGCGAACAG GCTTTGATTG ACGCGCTGCG CCGGGCGGTT 
GCGTTTCGCC GGTTCCGCGC TGCCGACGTG CGCTCGATCC TGGCCGCCGG CGCCGGCACC 
CCACAACCCC GCCCCGCCGG CGACGCACTC GTGCTCGATC TGCCCACCGT CGAGACCCGC 
TCGTTGGAGG CCTACAAGAT CAACACCACC GACGGGACGG CCTCATGACC ACCGCTGCCA 
AGCCGGTGGC ACCGTCCTCG GCGGCACCGC TGGCTGCTGA CCTTGACGCG GCGCTGCGGC 
GGTTGAAGCT GGCCACGGTG CGCCGCAACG CCGCCGAGGT GTTGCAAGTC GCCAAGACGC 
AACGCTGGAC ACCGGAGGAG ATCCTGCGGA CGTTGGTTGA GGCCGAGATC GCTGCCCGCG 
ATGCCTCCAA CACCGCCAAC CGTCTCAAGG CCGCAGCCTT CCCGGTCACC AAGACCCTCG 
ACGGGTTCGA CGTCACCGGA TCGTCGATCA CCGCAGCCAC GTTCGACTAC CTGTCGAGCC 
TGGAATGGAT TCGGGCACAA CAGAACCTGG CGGTCATTGG CCCACCTGGT ACGGGCAAAA 
GTCACCTGCT CATCGGCTGC GGGCACGCTG CCGTCCACGC CGGATTCAAA GTCCGCTACT 
TCACCGCCGC CGACCTGATC GAGGTCCTCT ACCGCGGCCT GGCCGACAAC ACCGTCGGCA 
AGATCATCGA CACCCTGCTC CGCGCGGATC TGGTCATCTT GGACGAGATC GGCTTCGCCC 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 





CGCTCGACGA CACCGGGACT CAACTGTTGT TCCGGCTCGT GGCTGCCGGC TACGAGCGCC 1560 

GCTCCCTGGC CATCGCCTCG CATTGGCCCT TCGAACAATG GGGGCGATTC CTGCCCGAGC 1620 

ACACCACCGC CGCCAGCATC CTCGATCGGC TGCTGCACCA CGCCAGCATC GTCGTCACCT 1680 

CCGGCGAGTC CTACCGGATG CGCCACGCCG ACCACAAGAA GGGAGCCGCC AAGAATTAG 1739 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Gly Cys Leu Lys Gly Gly Val Val Ala Asn Val Val Val Pro Thr 
15 10 15 

Pro Asp Tyr Val Arg Phe Ala Ser His Tyr Gly Phe Val Pro Asp Phe 
20 25 30 

Cys His Gly Ala Asp Pro Gin Ser Lys Gly He Val Glu Asn Leu Cys 
35 40 45 

Gly Tyr Ala Gin Asp Asp Leu Ala Val Pro Leu Leu Thr Glu Ala Ala 
50 55 60 

Leu Ala Gly Glu Gin Val Asp Leu Arg Ala Leu Asn Ala Gin Ala Gin 
65 70 75 80 

Leu Trp Cys Ala Glu Val Asn Ala Thr Val His Ser Glu He Cys Ala 
85 90 95 

Val Pro Asn Asp Arg Leu Val Asp Glu Arg Thr Val Leu Arg Glu Leu 
100 105 110 

Pro Ser Leu Arg Pro Thr He Gly Ser Gly Ser Val Arg Arg Lys Val 
115 120 125 

Asp Gly Leu Ser Cys He Arg Tyr Gly Ser Ala Arg Tyr Ser Val Pro 
130 135 140 

Gin Arg Leu Val Gly Ala Thr Val Ala Val Val Val Asp His Gly Ala 
145 150 155 160 





Leu He Leu Leu Glu Pro Ala Thr Gly Val lie Val Ala Glu His Glu 
165 170 175 

Leu Val Ser Pro Gly Glu Val Ser He Leu Asp Glu His Tyr Asp Gly 
180 185 190 

Pro Arg Pro Ala Pro Ser Arg Gly Pro Arg Pro Lys Thr Gin Ala Glu 
195 200 205 

Lys Arg Phe Cys Ala Leu Gly Thr Glu Ala Gin Gin Phe Leu Val Gly 
210 215 220 

Ala Ala Ala He Gly Asn Thr Arg Leu Lys Ser Glu Leu Asp He Leu 
225 230 235 240 

Leu Gly Leu Gly Ala Ala His Gly Glu Gin Ala Leu He Asp Ala Leu 
245 250 255 

Arg Arg Ala Val Ala Phe Arg Arg Phe Arg Ala Ala Asp Val Arg Ser 
260 265 270 

He Leu Ala Ala Gly Ala Gly Thr Pro Gin Pro Arg Pro Ala Gly Asp 
275 280 285 

Ala Leu Val Leu Asp Leu Pro Thr Val Glu Thr Arg Ser Leu Glu Ala 
290 295 300 

Tyr Lys He Asn Thr Thr Asp Gly Thr Ala Ser 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Thr Thr Ala Ala Lys Pro Val Ala Pro Ser Ser Ala Ala Pro Leu 
15 10 15 

Ala Ala Asp Leu Asp Ala Ala Leu Arg Arg Leu Lys Leu Ala Thr Val 



Arg Arg Asn Ala Ala Glu Val Leu Gin Val Ala Lys Thr Gin Arg Trp 



20 



25 



30 






35 



40 



45 



Thr Pro Glu Glu He Leu Arg Thr Leu Val Glu Ala Glu He Ala Ala 
50 55 60 

Arg Asp Ala Ser Asn Thr Ala Asn Arg Leu Lys Ala Ala Ala Phe Pro 
65 70 75 80 

Val Thr Lys Thr Leu Asp Gly Phe Asp Val Thr Gly Ser Ser lie Thr 
85 90 95 

Ala Ala Thr Phe Asp Tyr Leu Ser Ser Leu Glu Trp He Arg Ala Gin 
100 105 110 

Gin Asn Leu Ala Val He Gly Pro Pro Gly Thr Gly Lys Ser His Leu 
115 120 125 

Leu He Gly Cys Gly His Ala Ala Val His Ala Gly Phe Lys Val Arg 
130 135 140 

Tyr Phe Thr Ala Ala Asp Leu He Glu Val Leu Tyr Arg Gly Leu Ala 
145 150 155 160 

Asp Asn Thr Val Gly Lys He He Asp Thr Leu Leu Arg Ala Asp Leu 
165 170 175 

Val He Leu Asp Glu He Gly Phe Ala Pro Leu Asp Asp Thr Gly Thr 
180 185 190 

Gin Leu Leu Phe Arg Leu Val Ala Ala Gly Tyr Glu Arg Arg Ser Leu 
195 200 205 

Ala He Ala Ser His Trp Pro Phe Glu Gin Trp Gly Arg Phe Leu Pro 
210 215 220 

Glu His Thr Thr Ala Ala Ser He Leu Asp Arg Leu Leu His His Ala 
225 230 235 240 

Ser He Val Val Thr Ser Gly Glu Ser Tyr Arg Met Arg His Ala Asp 
245 250 255 

His Lys Lys Gly Ala Ala Lys Asn 



260 



(2) INFORMATION FOR 



SEQ 



ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 789 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEONESS: both 

(0) TOPOLOGY : linear 

(ii) MOLECULE TYPE: ONA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GTGACGTCTG CTCCGACCGT CTCGGTGATA ACGATCTCGT TCAACGACCT CGACGGGTTG 
CAGCGCACGG TGAAAAGTGT GCGGGCGCAA CGCTACCGGG GACGCATCGA GCACATCGTA 
ATCGACGGTG GCAGCGGCGA CGACGTGGTG GCATACCTGT CCGGGTGTGA ACCAGGCTTC 
6CGTATTGGC AGTCCGAGCC CGACGGCGGG CGGTACGACG CGATGAACCA GGGCATCGCG 
CACGCATCGG GTGATCTGTT GTGGTTCTTG CACTCCGCCG ATCGTTTTTC CGGGCCCGAC 
GTGGTAGCCC AGGCCGTGGA GGCGCTATCC GGCAAGGGAC CGGTGTCCGA ATTGTGGGGC 
TTCGGGATGG ATCGTCTCGT CGGGCTCGAT CGGGTGCGCG GCCCGATACC TTTCAGCCTG 
CGCAAATTCC TGGCCGGCAA GCAGGTTGTT CCGCATCAAG CATCGTTCTT CGGATCATCG 
CTGGTGGCCA AGATCGGTGG CTACGACCTT GATTTCGGGA TCGCCGCCGA CCAGGAATTC 
ATATTGCGGG CCGCGCTGGT ATGCGAGCCG GTCACGATTC GGTGTGTGCT GTGCGAGTTC 
GACACCACGG GCGTCGGCTC GCACCGGGAA CCAAGCGCGG TCTTCGGTGA TCTGCGCCGC 
ATGGGCGACC TTCATCGCCG CTACCCGTTC GGGGGAAG6C GAATATCACA TGCCTACCTA 
CGCGGCCGGG AGTTCTACGC CTACAACAGT CGATTCTGGG AAAACGTCTT CACGCGAATG 
TCGAAATAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
789 



4 1 



Met Thr Ser Al. Pro Thr Val Ser v.l He Thr He Ser Phe Asn Asp 
I 5 10 

Leu Asp Gly Leu Gin Arg Thr Val Lys Ser Val Arg Ala Gin Arg Tyr 

20 25 
Arg Gly Arg He Glu His He Val He Asp Gly Gly Ser Gly Asp Asp 

Val Val Ala Tyr Leu Ser Gly Cys Glu Pro Gly Phe Ala Tyr Trp Gin 

50 55 60 

Ser Glu Pro Asp Gly Gly Arg Tyr Asp Ala Met Asn Gin Gly He Ala 
65 70 " 

His Ala Ser Gly Asp Leu Leu Trp Phe Leu His Ser Ala Asp Arg Phe 
85 yu 

Ser Gly Pro Asp Val Val A!a Gln Ala Val Glu Ala Leu Ser Gly Lys 

100 105 
Gly Pro Val Ser Glu Leu Trp Gly Phe Gly Met Asp Arg Leu Val Gly 



115 



120 



Leu Asp Arg Val Arg Gly Pro He Pro Phe Ser Leu Arg Lys Phe Leu 



130 



135 



Ala 
145 



Gly Lys Gin Val Val Pro His Gin Ala Ser Phe Phe Gly Ser Ser 
150 155 



Leu Val Ala Lys He Gly Gly Tyr Asp Leu Asp Phe Gly He Ala Ala 



165 



170 



Asp Gl 



n Glu Phe lie Leu Arg Ala Ala Leu Val Cys Glu Pro Val Thr 



180 



185 



He 



Arg Cys Val Leu Cys Glu Phe Asp Thr Thr Gly Val Gly Ser His 



195 



200 



Arg Glu Pro Ser A!a Val Phe Gly Asp Leu Arg Arg Met Gly Asp Leu 



210 



215 



His Arg Arg Tyr Pro Phe Gly Gly Arg Arg lie Ser His Ala Tyr Leu 



225 



230 



Arg Sly Arg Glu Phe Tyr Ala Tyr Asn Ser Arg Phe Trp Glu Asn Va, 
245 " u 





Phe Thr Arg Met Ser Lys 
260 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTGAAGCGAG CGCTCATCAC CGGAATCACC GGCCAGGACG GCTCGTATCT CGCCGAACTG 
CTGCTGGCCA AGGGGTATGA GGTTCACGGG CTCATCCGGC GCGCTTCGAC GTTCAACACC 
TCGCGGATCG ATCACCTCTA CGTCGACCCG CACCAACCGG GCGCGCGGCT GTTTCTGCAC 
TATGGTGACC TGATCGACGG AACCCGGTTG GTGACCCTGC TGAGCACCAT CGAACCCGAC 
GAGGTGTACA ACCTGGCGGC GCAGTCACAC GTGCGGGTGA GCTTCGACGA ACCCGTGCAC 
ACCGGTGACA CCACCGGCAT GGGATCCATG CGACTGCTGG AAGCCGTTCG GCTCTCTCGG 
GTGCACTGCC GCTTCTATCA GGCGTCCTCG TCGGAGATGT TCGGCGCCTC GCCGCCACCG 
CAGAACGAGC TGACGCCGTT CTACCCGCGG TCACCGTATG GCGCCGCCAA GGTCTATTCG 
TACTGGGCGA CCCGCAATTA TCGCGAAGCG TACGGATTGT TCGCCGTTAA CGGCATCTTG 
TTCAATCACG AATCACCGCG GCGCGGTGAG ACGTTCGTGA CCCGAAAGAT CACCAGGGCC 
GTGGCACGCA TCAAGGCCGG TATCCAGTCC GAGGTCTATA TGGGCAATCT GGATGCGGTC 
CGCGACTGGG GGTACGCGCC CGAATACGTC GAAGGCATGT GGCGGATGCT GCAGACCGAC 
GAGCCCGACG ACTTCGTTTT GGCGACCGGG CGCGGTTTCA CCGTGCGTGA GTTCGCGCGG 
GCCGCGTTCG AGCATGCCGG TTTGGACTGG CAGCAGTACG TGAAATTCGA CCAACGCTAT 
CTGCGGCCCA CCGAGGTGGA TTCGCTGATC GGCGACGCGA CCAAGGCTGC CGAATTGCTG 
GGCTGGAGGG CTTCGGTGCA CACTGACGAG TTGGCTCGGA TCATGGTCGA CGCGGACATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 




Ul 



GCGGCGCTGG AGTGCGAAGG CAAGCCGTGG ATCGACAAGC CGATGATCGC CGGCCGGACA 1020 
TGA 1023 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Lys Arg Ala Leu He Thr Gly He Thr Gly Gin Asp Gly Ser Tyr 
1 5 10 15 

Leu Ala Glu Leu Leu Leu Ala Lys Gly Tyr Glu Val His Gly Leu He 
20 25 30 

Arg Arg Ala Ser Thr Phe Asn Thr Ser Arg He Asp His Leu Tyr Val 
35 40 45 

Asp Pro His Gin Pro Gly Ala Arg Leu Phe Leu His Tyr Gly Asp Leu 
50 55 60 

He Asp Gly Thr Arg Leu Val Thr Leu Leu Ser Thr He Glu Pro Asp 
65 70 75 80 

Glu Val Tyr Asn Leu Ala Ala Gin Ser His Val Arg Val Ser Phe Asp 
85 90 95 

Glu Pro Val His Thr Gly Asp Thr Thr Gly Met Gly Ser Met Arg Leu 
100 105 110 

Leu Glu Ala Val Arg Leu Ser Arg Val His Cys Arg Phe Tyr Gin Ala 
115 N 120 125 

Ser Ser Ser Glu Met Phe Gly Ala Ser Pro Pro Pro Gin Asn Glu Leu 
130 135 140 

Thr Pro Phe Tyr) Pro Arg Ser Pro Tyr Gly Ala Ala Lys Val Tyr Ser 
145 150 155 160 

•J, 

Tyr Trp Ala Thr Arg Asn Tyr Arg Glu Ala Tyr Gly Leu Phe Ala Val 
^1 165 170 175 

■; * <\f 

'■ y 



# 



Asn Gly lie Leu Phe Asn His Glu Ser Pro Arg Arg Gly Glu Thr Phe 
180 185 190 

Val Thr Arg Lys lie Thr Arg Ala Val Ala Arg He Lys Ala Gly He 
195 200 205 

Gin Ser Glu Val Tyr Met Gly Asn Leu Asp Ala Val Arg Asp Trp Gly 
210 215 220 

Tyr Ala Pro Glu Tyr Val Glu Gly Met Trp Arg Met Leu Gin Thr Asp 
225 230 235 240 

Glu Pro Asp Asp Phe Val Leu Ala Thr Gly Arg Gly Phe Thr Val Arg 
245 250 255 

Glu Phe Ala Arg Ala Ala Phe Glu His Ala Gly Leu Asp Trp Gin Gin 
260 265 270 

Tyr Val Lys Phe Asp Gin Arg Tyr Leu Arg Pro Thr Glu Val Asp Ser 
275 280 285 

Leu He Gly Asp Ala Thr Lys Ala Ala Glu Leu Leu Gly Trp Arg Ala 
290 295 300 

Ser Val His Thr Asp Glu Leu Ala Arg He Met Val Asp Ala Asp Met 
305 310 315 320 

Ala Ala Leu Glu Cys Glu Gly Lys Pro Trp He Asp Lys Pro Met He 
325 330 335 

Ala Gly Arg Thr 
340 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .729 



# 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATG AGG CTG GCC CGT CGC GCT CGG AAC ATC TTG CGT CGC AAC GGC ATC 48 
Met Arg Leu Ala Arg Arg Ala Arg Asn He Leu Arg Arg Asn Gly He 
345 350 355 

GAG GTG TCG CGC TAC TTT GCC GAA CTG GAC TGG GAA CGC AAT TTC TTG 96 
Glu Val Ser Arg Tyr Phe Ala Glu Leu Asp Trp Glu Arg Asn Phe Leu 
360 365 370 

CGC CAA CTG CAA TCG CAT CGG GTC AGT GCC GTG CTC GAT GTC GGG GCC 144 
Arg Gin Leu Gin Ser His Arg Val Ser Ala Val Leu Asp Val Gly Ala 
375 380 385 

AAT TCG GGG CAG TAC GCC AGG GGT CTG CGC GGC GCG GGC TTC GCG GGC 192 
Asn Ser Gly Gin Tyr Ala Arg Gly Leu Arg Gly Ala Gly Phe Ala Gly 
390 395 400 

CGC ATC GTC TCG TTC GAG CCG CTG CCC GGG CCC TTT GCC GTC TTG CAG 240 
Arg He Val Ser Phe Glu Pro Leu Pro Gly Pro Phe Ala Val Leu Gin 
405 410 415 420 

CGC AGC GCC TCC ACG GAC CCG TTG TGG GAA TGC CGG CGC TGT GCG CTG 288 
Arg Ser Ala Ser Thr Asp Pro Leu Trp Glu Cys Arg Arg Cys Ala Leu 
425 430 435 

GGC GAT GTC GAT GGA ACC ATC TCG ATC AAC GTC GCC GGC AAC GAG GGC 336 
Gly Asp Val Asp Gly Thr He Ser He Asn Val Ala Gly Asn Glu Gly 
440 445 450 

GCC AGC AGT TCC GTC TTG CCG ATG TTG AAA CGA CAT CAG GAC GCC TTT 384 
Ala Ser Ser Ser Val Leu Pro Met Leu Lys Arg His Gin Asp Ala Phe 
455 460 465 

CCA CCA GCC AAC TAC GTG GGC GCC CAA CGG GTG CCG ATA CAT CGA CTC 432 
Pro Pro Ala Asn Tyr Val Gly Ala Gin Arg Val Pro He His Arg Leu 
470 475 480 

GAT TCC GTG GCT GCA GAC GTT CTG CGG CCC AAC GAT ATT GCG TTC TTG 480 
Asp Ser Val Ala Ala Asp Val Leu Arg Pro Asn Asp He Ala Phe Leu 
485 490 495 500 

AAG ATC GAC GTT CAA GGA TTC GAG AAG CAG GTG ATC GCG GGT GGC GAT 528 
Lys He Asp Val Gin Gly Phe Glu Lys Gin Val He Ala Gly Gly Asp 
505 510 515 

TCA ACG GTG CAC GAC CGA TGC GTC GGC ATG CAG CTC GAG CTG TCT TTC 576 
Ser Thr Val His Asp Arg Cys Val Gly Met Gin Leu Glu Leu Ser Phe 



520 525 530 

CAG CCG TTG TAC GAG GGT GGC ATG CTC ATC CGC GAG GCG CTC GAT CTC 624 
Gin Pro Leu Tyr Glu Gly Gly Met Leu He Arg Glu Ala Leu Asp Leu 
535 540 545 

GTG GAT TCG TTG GGC TTT ACG CTC TCG GGA TTG CAA CCC GGT TTC ACC 672 
Val Asp Ser Leu Gly Phe Thr Leu Ser Gly Leu Gin Pro Gly Phe Thr 
550 555 560 

GAC CCC CGC AAC GGT CGA ATG CTG CAG GCC GAT GGC ATC TTC TTC CGG 720 
Asp Pro Arg Asn Gly Arg Met Leu Gin Ala Asp Gly He Phe Phe Arg 
565 570 575 580 

GGC AGC GAT TGA 732 
Gly Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Arg Leu Ala Arg Arg Ala Arg Asn lie Leu Arg Arg Asn Gly He 
1 5 10 15 

Glu Val Ser Arg Tyr Phe Ala Glu Leu Asp Trp Glu Arg Asn Phe Leu 
20 25 30 

Arg Gin Leu Gin Ser His Arg Val Ser Ala Val Leu Asp Val Gly Ala 
35 40 45 

Asn Ser Gly Gin Tyr Ala Arg Gly Leu Arg Gly Ala Gly Phe Ala Gly 
50 55 60 

Arg He Val Ser Phe Glu Pro Leu Pro Gly Pro Phe Ala Val Leu Gin 
65 70 75 80 

Arg Ser Ala Ser Thr Asp Pro Leu Trp Glu Cys Arg Arg Cys Ala Leu 
85 90 95 

Gly Asp Val Asp Gly Thr He Ser He Asn Val Ala Gly Asn Glu Gly 



<?7 



100 



105 



110 



Ala Ser Ser Ser Val Leu Pro Met Leu Lys Arg His Gin Asp Ala Phe 
115 120 125 

Pro Pro Ala Asn Tyr Val Gly Ala Gin Arg Val Pro He His Arg Leu 
130 135 140 

Asp Ser Val Ala Ala Asp Val Leu Arg Pro Asn Asp He Ala Phe Leu 
145 150 155 160 

Lys He Asp Val Gin Gly Phe Glu Lys Gin Val He Ala Gly Gly Asp 
165 170 175 

Ser Thr Val His Asp Arg Cys Val Gly Met Gin Leu Glu Leu Ser Phe 
180 185 190 

Gin Pro Leu Tyr Glu Gly Gly Met Leu He Arg Glu Ala Leu Asp Leu 
195 200 205 

Val Asp Ser Leu Gly Phe Thr Leu Ser Gly Leu Gin Pro Gly Phe Thr 
210 215 220 

Asp Pro Arg Asn Gly Arg Met Leu Gin Ala Asp Gly He Phe Phe Arg 
225 230 235 240 



Gly Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 729 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GTG AAA TCG TTG AAA CTC GCT CGT TTC ATC GCG CGT AGC GCC GCC TTC 



Val Lys Ser Leu Lys Leu Ala Arg Phe He Ala Arg Ser Ala Ala Phe 
245 250 255 

GAG GTT TCG CGC CGC TAT TCT GAG CGA GAC CTG AAG CAC CAG TTT GTG 96 
Glu Val Ser Arg Arg Tyr Ser Glu Arg Asp Leu Lys His Gin Phe Val 
260 265 270 275 

AAG CAA CTC AAA TCG CGT CGG GTA GAT GTC GTT TTC GAT GTC GGC GCC 144 
Lys Gin Leu Lys Ser Arg Arg Val Asp Val Val Phe Asp Val Gly Ala 
280 285 290 

AAC TCA GGA CAA TAC GCC GCC GGC CTC CGC CGA GCA GCA TAT AAG GGC 192 
Asn Ser Gly Gin Tyr Ala Ala Gly Leu Arg Arg Ala Ala Tyr Lys Gly 
295 300 305 

CGC ATT GTC TCG TTC GAA CCG CTA TCC GGA CCG TTT ACG ATC TTG GAA 240 
Arg He Val Ser Phe Glu Pro Leu Ser Gly Pro Phe Thr He Leu Glu 
310 315 320 

AGC AAA GCG TCA ACG GAT CCA CTT TGG GAT TGC CGG CAG CAT GCG TTG 288 
Ser Lys Ala Ser Thr Asp Pro Leu Trp Asp Cys Arg Gin His Ala Leu 
325 330 335 

GGC GAT TCT GAT GGA ACG GTT ACG ATC AAT ATC GCA GGA AAC GCC GGT 336 
Gly Asp Ser Asp Gly Thr Val Thr He Asn He Ala Gly Asn Ala Gly 
340 345 350 355 

CAG AGC AGT TCC GTC TTG CCC ATG CTG AAA AGT CAT CAG AAC GCT TTT 384 
Gin Ser Ser Ser Val Leu Pro Met Leu Lys Ser His Gin Asn Ala Phe 
360 365 370 

CCC CCG GCA AAC TAT GTC GGT ACC CAA GAG GCG TCC ATA CAT CGA CTT 432 
Pro Pro Ala Asn Tyr Val Gly Thr Gin Glu Ala Ser He His Arg Leu 
375 380 385 

GAT TCC GTG GCG CCA GAA TTT CTA GGC ATG AAC GGT GTC GCT TTT CTC 480 
Asp Ser Val Ala Pro Glu Phe Leu Gly Met Asn Gly Val Ala Phe Leu 
390 395 400 

AAG GTC GAC GTT CAA GGC TTT GAA AAG CAG GTG CTC GCC GGG GGC AAA 528 
Lys Val Asp Val Gin Gly Phe Glu Lys Gin Val Leu Ala Gly Gly Lys 
405 410 415 

TCA ACC ATA GAT GAC CAT TGC GTC GGC ATG CAA CTC GAA CTG TCC TTC 576 
Ser Thr He Asp Asp His Cys Val Gly Met Gin Leu Glu Leu Ser Phe 
420 425 430 435 

CTG CCG TTG TAC GAA GGT GGC ATG CTC ATT CCT GAA GCC CTC GAT CTC 624 



qq 



Leu Pro Leu Tyr Glu Gly Gly Met Leu lie Pro Glu Ala Leu Asp Leu 
440 445 450 

GTG TAT TCC TTG GGC TTC ACG TTG ACG GGA TTG CTG CCT TGT TTC ATT 672 
Val Tyr Ser Leu Gly Phe Thr Leu Thr Gly Leu Leu Pro Cys Phe He 
455 460 465 

GAT GCA AAT AAT GGT CGA ATG TTG CAG GCC GAC GGC ATC TTT TTC CGC 720 
Asp Ala Asn Asn Gly Arg Met Leu Gin Ala Asp Gly He Phe Phe Arg 
470 475 480 

GAG GAC GAT TGA 732 
Glu Asp Asp 
485 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Val Lys Ser Leu Lys Leu Ala Arg Phe He Ala Arg Ser Ala Ala Phe 
15 10 15 

Glu Val Ser Arg Arg Tyr Ser Glu Arg Asp Leu Lys His Gin Phe Val 
20 25 30 

Lys Gin Leu Lys Ser Arg Arg Val Asp Val Val Phe Asp Val Gly Ala 
35 40 45 

Asn Ser Gly Gin Tyr Ala Ala Gly Leu Arg Arg Ala Ala Tyr Lys Gly 
50 55 60 

Arg He Val Ser Phe Glu Pro Leu Ser Gly Pro Phe Thr He Leu Glu 
65 70 75 80 

Ser Lys Ala Ser Thr Asp Pro Leu Trp Asp Cys Arg Gin His Ala Leu 
85 90 95 

Gly Asp Ser Asp Gly Thr Val Thr He Asn He Ala Gly Asn Ala Gly 
100 105 110 

Gin Ser Ser Ser Val Leu Pro Met Leu Lys Ser His Gin Asn Ala Phe 



(00 




115 120 125 

Pro Pro Ala Asn Tyr Val Gly Thr Gin Glu Ala Ser lie His Arg Leu 
130 135 140 

Asp Ser Val Ala Pro Glu Phe Leu Gly Met Asn Gly Val Ala Phe Leu 
145 150 155 160 

Lys Val Asp Val Gin Gly Phe Glu Lys Gin Val Leu Ala Gly Gly Lys 
165 170 175 

Ser Thr lie Asp Asp His Cys Val Gly Met Gin Leu Glu Leu Ser Phe 
180 185 190 

Leu Pro Leu Tyr Glu Gly Gly Met Leu He Pro Glu Ala Leu Asp Leu 
195 200 205 

Val Tyr Ser Leu Gly Phe Thr Leu Thr Gly Leu Leu Pro Cys Phe He 
210 215 220 

Asp Ala Asn Asn Gly Arg Met Leu Gin Ala Asp Gly He Phe Phe Arg 
225 230 235 240 

Glu Asp Asp 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:!. .825 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ATG GTG CAG ACG AAA CGA TAC GCC GGC TTG ACC GCA GCT AAC ACA AAG 48 
Met Val Gin Thr Lys Arg Tyr Ala Gly Leu Thr Ala Ala Asn Thr Lys 
245 250 255 



161 



AAA GTC GCC ATG GCC GCA CCA ATG TTT TCG ATC ATC ATC CCC ACC TTG 
Lys Val Ala Met Ala Ala Pro Met Phe Ser He He He Pro Thr Leu 
260 265 270 275 



96 



AAC GTG GCT GCG GTA TTG CCT GCC TGC CTC GAC AGC ATC GCC CGT CAG 
Asn Val Ala Ala Val Leu Pro Ala Cys Leu Asp Ser He Ala Arg Gin 
280 285 290 



144 



ACC TGC GGT GAC TTC GAG CTG GTA CTG GTC GAC GGC GGC TCG ACG GAC 
Thr Cys Gly Asp Phe Glu Leu Val Leu Val Asp Gly Gly Ser Thr Asp 
295 300 305 



192 



GAA ACC CTC GAC ATC GCC AAC ATT TTC GCC CCC AAC CTC GGC GAG CGG 
Glu Thr Leu Asp lie Ala Asn He Phe Ala Pro Asn Leu Gly Glu Arg 
310 315 320 



240 



TTG ATC ATT CAT CGC GAC ACC GAC CAG GGC GTC TAC GAC GCC ATG AAC 
Leu lie He His Arg Asp Thr Asp Gin Gly Val Tyr Asp Ala Met Asn 
325 330 335 



288 



CGC GGC GTG GAC CTG GCC ACC GGA ACG TGG TTG CTC TTT CTG GGC GCG 
Arg Gly Val Asp Leu Ala Thr Gly Thr Trp Leu Leu Phe Leu Gly Ala 
340 345 350 355 



336 



GAC GAC AGC CTG TAC GAG GCT GAC ACC CTG GCG CGG GTG GCC GCC TTC 
Asp Asp Ser Leu Tyr Glu Ala Asp Thr Leu Ala Arg Val Ala Ala Phe 
360 365 370 



384 



ATT GGC GAA CAC GAG CCC AGC GAT CTG GTA TAT GGC GAC GTG ATC ATG 
lie Gly Glu His Glu Pro Ser Asp Leu Val Tyr Gly Asp Val He Met 
375 380 385 



432 



CGC TCA ACC AAT TTC CGC TGG GGT GGC GCC TTC GAC CTC GAC CGT CTG 
Arg Ser Thr Asn Phe Arg Trp Gly Gly Ala Phe Asp Leu Asp Arg Leu 
390 395 400 



480 



TTG TTC AAG CGC AAC ATC TGC CAT CAG GCG ATC TTC TAC CGC CGC GGA 
Leu Phe Lys Arg Asn He Cys His Gin Ala He Phe Tyr Arg Arg Gly 
405 410 415 



528 



CTC TTC GGC ACC ATC GGT CCC TAC AAC CTC CGC TAC CGG GTC CTG GCC 
Leu Phe Gly Thr He Gly Pro Tyr Asn Leu Arg Tyr Arg Val Leu Ala 
420 425 430 435 



576 



GAC TGG GAC TTC AAT ATT CGC TGC TTT TCC AAC CCA GCG CTC GTC ACC 
Asp Trp Asp Phe Asn He Arg Cys Phe Ser Asn Pro Ala Leu Val Thr 
440 445 450 



624 



(6^ 



CGC TAC ATG CAC GTG GTC GTT GCA AGC TAC AAC GAA TTC GGC GGG CTC 672 
Arg Tyr Met His Val Val Val Ala Ser Tyr Asn Glu Phe Gly Gly Leu 
455 460 465 

AGC AAT ACG ATC GTC GAC AAG GAG TTT TTG AAG CGG CTG CCG ATG TCC 720 
Ser Asn Thr lie Val Asp Lys Glu Phe Leu Lys Arg Leu Pro Met Ser 
470 475 480 

ACG AGA CTC GGC ATA AGG CTG GTC ATA GTT CTG GTG CGC AGG TGG CCA 768 
Thr Arg Leu Gly He Arg Leu Val He Val Leu Val Arg Arg Trp Pro 
485 490 495 

AAG GTG ATC AGC AGG GCC ATG GTA ATG CGC ACC GTC ATT TCT TGG CGG 816 
Lys Val He Ser Arg Ala Met Val Met Arg Thr Val He Ser Trp Arg 
500 505 510 515 

CGC CGA CGT TAG 828 
Arg Arg Arg 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Val Gin Thr Lys Arg Tyr Ala Gly Leu Thr Ala Ala Asn Thr Lys 
15 10 15 

Lys Val Ala Met Ala Ala Pro Met Phe Ser lie He He Pro Thr Leu 
20 25 30 

Asn Val Ala Ala Val Leu Pro Ala Cys Leu Asp Ser He Ala Arg Gin 
35 40 45 

Thr Cys Gly Asp Phe Glu Leu Val Leu Val Asp Gly Gly Ser Thr Asp 
50 55 60 

Glu Thr Leu Asp He Ala Asn He Phe Ala Pro Asn Leu Gly Glu Arg 
65 70 75 80 

Leu He He His Arg Asp Thr Asp Gin Gly Val Tyr Asp Ala Met Asn 
85 90 95 



* • 



Arg Gly Val Asp Leu Ala Thr Gly Thr Trp Leu Leu Phe Leu Gly Ala 
100 105 110 

Asp Asp Ser Leu Tyr Glu Ala Asp Thr Leu Ala Arg Val Ala Ala Phe 
115 120 125 

He Gly Glu His Glu Pro Ser Asp Leu Val Tyr Gly Asp Val He Met 
130 135 140 

Arg Ser Thr Asn Phe Arg Trp Gly Gly Ala Phe Asp Leu Asp Arg Leu 
145 150 155 160 

Leu Phe Lys Arg Asn He Cys His Gin Ala lie Phe Tyr Arg Arg Gly 
165 170 175 

Leu Phe Gly Thr He Gly Pro Tyr Asn Leu Arg Tyr Arg Val Leu Ala 
180 185 190 

Asp Trp Asp Phe Asn He Arg Cys Phe Ser Asn Pro Ala Leu Val Thr 
195 200 205 

Arg Tyr Met His Val Val Val Ala Ser Tyr Asn Glu Phe Gly Gly Leu 
210 215 220 

Ser Asn Thr lie Val Asp Lys Glu Phe Leu Lys Arg Leu Pro Met Ser 
225 230 235 240 

Thr Arg Leu Gly He Arg Leu Val He Val Leu Val Arg Arg Trp Pro 
245 250 255 

Lys Val He Ser Arg Ala Met Val Met Arg Thr Val He Ser Trp Arg 
260 265 270 

Arg Arg Arg 
275 

(2) INFORMATION FOR SEQ ID NO: 40:' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



|04 




GATGCCGTGA GGAGGTAAAG CTGC 



24 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GATACGGCTC TTGAATCCTG CACG 24 



